TY - GEN
T1 - Parallel information theory based construction of gene regulatory networks
AU - Zola, Jaroslaw
AU - Aluru, Maneesha
AU - Aluru, Srinivas
PY - 2008
Y1 - 2008
N2 - We present a parallel method for construction of gene regulatory networks from large-scale gene expression data. Our method integrates mutual information, data processing inequality and statistical testing to detect significant dependencies between genes, and efficiently exploits parallelism inherent in such computations. We present a novel method to carry out permutation testing for assessing statistical significance while reducing its computational complexity by a factor of Θ(n 2), where n is the number of genes. Using both synthetic and known regulatory networks, we show that our method produces networks of quality similar to ARACNE, a widely used mutual information based method. We present a parallelization of the algorithm that, for the first time, allows construction of whole genome networks from thousands of microarray experiments using rigorous mutual information based methodology. We report the construction of a 15,147 gene network of the plant Arabidopsis thaliana from 2,996 microarray experiments on a 2,048-CPU Blue Gene/L in 45 minutes, thus addressing a grand challenge problem in the NSF Arabidopsis 2010 initiative.
AB - We present a parallel method for construction of gene regulatory networks from large-scale gene expression data. Our method integrates mutual information, data processing inequality and statistical testing to detect significant dependencies between genes, and efficiently exploits parallelism inherent in such computations. We present a novel method to carry out permutation testing for assessing statistical significance while reducing its computational complexity by a factor of Θ(n 2), where n is the number of genes. Using both synthetic and known regulatory networks, we show that our method produces networks of quality similar to ARACNE, a widely used mutual information based method. We present a parallelization of the algorithm that, for the first time, allows construction of whole genome networks from thousands of microarray experiments using rigorous mutual information based methodology. We report the construction of a 15,147 gene network of the plant Arabidopsis thaliana from 2,996 microarray experiments on a 2,048-CPU Blue Gene/L in 45 minutes, thus addressing a grand challenge problem in the NSF Arabidopsis 2010 initiative.
KW - Gene networks
KW - Mutual information
KW - Parallel computational biology
KW - Systems biology
UR - https://www.scopus.com/pages/publications/58649124319
U2 - 10.1007/978-3-540-89894-8_31
DO - 10.1007/978-3-540-89894-8_31
M3 - Conference contribution
AN - SCOPUS:58649124319
SN - 354089893X
SN - 9783540898931
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 336
EP - 349
BT - High Performance Computing - HiPC 2008 - 15th International Conference, Proceedings
PB - Springer Verlag
T2 - 15th International Conference on High Performance Computing, HiPC 2008
Y2 - 17 December 2008 through 20 December 2008
ER -