TY - GEN
T1 - Mining κ-median chromosome association graphs from a population of heterogeneous cells
AU - Chen, Zihe
AU - Ding, Hu
AU - Chen, Danyang
AU - Wang, Xiangyu
AU - Fritz, Andrew
AU - Sehgal, Nitasha
AU - Berezney, Ronald
AU - Xu, Jinhui
N1 - Publisher Copyright:
Copyright 2015 ACM.
PY - 2015/9/9
Y1 - 2015/9/9
N2 - Finding the structural pattern from a set of objects is a commonly encountered prototype learning problem in machine learning and pattern recognition. In graph domain, such a structure is called median graph. Existing research has demonstrated that computing an accurate median graph could be rather challenging. In this paper, we present a new technique for mining κ-median graphs from a population of heterogeneous cells. Each median graph is a representative structure of chromosome associations of a subset of the cells in the population. Comparing to existing techniques, our technique has several unique advantages. Firstly, it reveals, for the first time, the level of associations (or degree of associations) among the chromosomes. Secondly, it generates multiple median graphs simultaneously, and therefore can be used to handle heterogeneous data. Our technique is based on a number of interesting ideas, such as adaptive sampling, semi-definite programming model, embedding, and local search on uncertain data. Experimental results on both random and biological data sets suggest that our technique yield near optimal solutions.
AB - Finding the structural pattern from a set of objects is a commonly encountered prototype learning problem in machine learning and pattern recognition. In graph domain, such a structure is called median graph. Existing research has demonstrated that computing an accurate median graph could be rather challenging. In this paper, we present a new technique for mining κ-median graphs from a population of heterogeneous cells. Each median graph is a representative structure of chromosome associations of a subset of the cells in the population. Comparing to existing techniques, our technique has several unique advantages. Firstly, it reveals, for the first time, the level of associations (or degree of associations) among the chromosomes. Secondly, it generates multiple median graphs simultaneously, and therefore can be used to handle heterogeneous data. Our technique is based on a number of interesting ideas, such as adaptive sampling, semi-definite programming model, embedding, and local search on uncertain data. Experimental results on both random and biological data sets suggest that our technique yield near optimal solutions.
KW - Cell nucleus
KW - Machine learning
KW - Optimization
UR - https://www.scopus.com/pages/publications/84963632403
U2 - 10.1145/2808719.2808724
DO - 10.1145/2808719.2808724
M3 - Conference contribution
AN - SCOPUS:84963632403
T3 - BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
SP - 47
EP - 56
BT - BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
PB - Association for Computing Machinery, Inc
T2 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB 2015
Y2 - 9 September 2015 through 12 September 2015
ER -