TY - GEN
T1 - Selecting informative genes from microarray dataset by incorporating gene ontology
AU - Xian, Xu
AU - Aidong, Zhang
PY - 2005
Y1 - 2005
N2 - Selecting informative genes from microarray experiments is one of the most important data analysis steps for deciphering biological information imbedded in such experiments. However, due to the characteristics of microarray technology and the underlying biology, namely large number of genes and limited number of samples, the statistical soundness of gene selection algorithm becomes questionable. One major problem is the high false discover rate. Microarray experiment is only one facet of current knowledge of the biological system under study. In this paper, we propose to alleviate this high false discover rate problem by integrating domain knowledge into the gene selection process. Gene Ontology represents a controlled biological vocabulary and a repository of computable biological knowledge. It is shown in the literature that gene ontology-based similarities between genes carry significant information of the functional relationships [3]. Integration of such domain knowledge into gene selection algorithms enables us to remove noisy genes intelligently. We propose an add-on algorithm applied to any single gene-based discriminative scores integrating domain knowledge from gene ontology annotation. Preliminary experiments are performed on publicly available colon cancer dataset [2] to demonstrate the utility of the integration of domain knowledge for the purpose of gene selection. Our experiments show interesting results.
AB - Selecting informative genes from microarray experiments is one of the most important data analysis steps for deciphering biological information imbedded in such experiments. However, due to the characteristics of microarray technology and the underlying biology, namely large number of genes and limited number of samples, the statistical soundness of gene selection algorithm becomes questionable. One major problem is the high false discover rate. Microarray experiment is only one facet of current knowledge of the biological system under study. In this paper, we propose to alleviate this high false discover rate problem by integrating domain knowledge into the gene selection process. Gene Ontology represents a controlled biological vocabulary and a repository of computable biological knowledge. It is shown in the literature that gene ontology-based similarities between genes carry significant information of the functional relationships [3]. Integration of such domain knowledge into gene selection algorithms enables us to remove noisy genes intelligently. We propose an add-on algorithm applied to any single gene-based discriminative scores integrating domain knowledge from gene ontology annotation. Preliminary experiments are performed on publicly available colon cancer dataset [2] to demonstrate the utility of the integration of domain knowledge for the purpose of gene selection. Our experiments show interesting results.
UR - https://www.scopus.com/pages/publications/33751185128
U2 - 10.1109/BIBE.2005.51
DO - 10.1109/BIBE.2005.51
M3 - Conference contribution
AN - SCOPUS:33751185128
SN - 0769524761
SN - 9780769524764
T3 - Proceedings - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering
SP - 241
EP - 245
BT - Proceedings - BIBE 2005
T2 - BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering
Y2 - 19 October 2005 through 21 October 2005
ER -