TY - GEN
T1 - Finding informative genes from multiple microarray experiments
T2 - 2011 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2011
AU - Ge, Liang
AU - Du, Nan
AU - Zhang, Aidong
PY - 2011
Y1 - 2011
N2 - With the rapid advancement of biology technology, many micro array experiments are conducted towards the same problem of finding informative genes. Therefore, it is important to find a set of informative genes integrating multiple micro array experiments that achieves maximal consensus. Most previous researches formulated this problem as a rank aggregation problem. In this paper, we propose a novel Graph-based Consensus Maximization (GCM) model to estimate the conditional probability of each gene being informative, then the genes are ranked by this probability. The estimation of the probabilities is formulated as an optimization problem on a bipartite graph, where the criterion function favors the smoothness of the prediction over the graph and penalizes deviations from the initial input ranked lists from micro array experiments. We solve this problem through iterative propagation of probability estimates among neighboring nodes. In addition, when certain genes have already been identified to be informative, it has never been explored in the literature how to take advantage of such information to improve the consensus result. Our proposed GCM model can be naturally extended to incorporate such information, thus increasing the quality of the predicted result. In the experimental evaluation, we conducted experiments on the five prostate cancer micro array studies. The results showed that our model outperformed other baseline methods in finding informative genes. Furthermore, by adding only one piece of information that some gene is informative, our model yielded a significantly better result. The experimental evaluation demonstrates that the proposed GCM model is effective and superior in finding informative genes from multiple micro array experiments.
AB - With the rapid advancement of biology technology, many micro array experiments are conducted towards the same problem of finding informative genes. Therefore, it is important to find a set of informative genes integrating multiple micro array experiments that achieves maximal consensus. Most previous researches formulated this problem as a rank aggregation problem. In this paper, we propose a novel Graph-based Consensus Maximization (GCM) model to estimate the conditional probability of each gene being informative, then the genes are ranked by this probability. The estimation of the probabilities is formulated as an optimization problem on a bipartite graph, where the criterion function favors the smoothness of the prediction over the graph and penalizes deviations from the initial input ranked lists from micro array experiments. We solve this problem through iterative propagation of probability estimates among neighboring nodes. In addition, when certain genes have already been identified to be informative, it has never been explored in the literature how to take advantage of such information to improve the consensus result. Our proposed GCM model can be naturally extended to incorporate such information, thus increasing the quality of the predicted result. In the experimental evaluation, we conducted experiments on the five prostate cancer micro array studies. The results showed that our model outperformed other baseline methods in finding informative genes. Furthermore, by adding only one piece of information that some gene is informative, our model yielded a significantly better result. The experimental evaluation demonstrates that the proposed GCM model is effective and superior in finding informative genes from multiple micro array experiments.
UR - https://www.scopus.com/pages/publications/84856063518
U2 - 10.1109/BIBM.2011.34
DO - 10.1109/BIBM.2011.34
M3 - Conference contribution
AN - SCOPUS:84856063518
SN - 9780769545745
T3 - Proceedings - 2011 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2011
SP - 506
EP - 511
BT - Proceedings - 2011 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2011
Y2 - 12 November 2011 through 15 November 2011
ER -