Title | More accurate models for detecting gene-gene interactions from public expression compendia |
Creator | |
Date Issued | 2017-01-17 |
Conference Name | IEEE International Conference on Bioinformatics and Biomedicine (IEEE BIBM) |
Source Publication | Proceedings - 2016 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2016
![]() |
ISBN | 9781509016105 |
ISSN | 2156-1125 |
Pages | 1871-1878 |
Conference Date | DEC 15-18, 2016 |
Conference Place | Shenzhen, PEOPLES R CHINA |
Abstract | The fast accumulation of public gene expression data - made possible by high-throughput technology - provides us an unprecedented opportunity to identify functionally related genes by analyzing their co-expression patterns. However, these data are typically noisy and highly heterogeneous, complicating their use in constructing co-expression network in large expression compendia. Previous studies suggested that the collective gene expression pattern can be better modeled by Gaussian mixtures. This motivates our present work, which proposes Multimodal Mutual Information (MMI) to reconstruct gene co-expression network from public gene expression data. MMI assumes gene pair following bivariate Gaussian mixture models and categories the samples into unique bins with respect to their expression magnitude. Two kinds of correlations in MMI are computed and aggregated to capture both discretized dependency and the expression correlation for each bin. Through extensive simulations, MMI outperforms other approaches with respect to calculating gene-gene interactions, regardless of the level of noise or strength of interactions. The advance of MMI is further validated by three real problems: 1. Infer novel gene functions by their connections with the well-documented genes. We apply principle component analysis to the correlated matrix generated by MMI and Pearson correlation and construct transcriptional components to evaluate gene-gene interactions. MMI enables 1.7 times more eligible transcriptional components than Pearson correlation, which can be used to predict gene functions. 2. Prioritize candidate genes for an affected pedigree. MMI calculates the interactions between candidate and disease established genes and explores KIF1A as the new causal gene of pure hereditary spastic paraparesis. 3. Detect disease 'hot genes'. MMI identifies ANK2 as the 'hot gene' for autism spectrum disorders by evaluating its co-expression with other disease susceptible genes derived from trio-based exome sequencing data. |
DOI | 10.1109/BIBM.2016.7822804 |
URL | View source |
Indexed By | CPCI-S |
Language | 英语English |
WOS Research Area | Computer Science ; Medical Informatics |
WOS Subject | Computer Science, Interdisciplinary Applications ; Medical Informatics |
WOS ID | WOS:000393191700319 |
Scopus ID | 2-s2.0-85013345417 |
Citation statistics | |
Document Type | Conference paper |
Identifier | http://repository.uic.edu.cn/handle/39GCC9TT/9049 |
Collection | Research outside affiliated institution |
Corresponding Author | Li, Shuaicheng |
Affiliation | Department of Computer Science,City University of Hong Kong,Hong Kong |
Recommended Citation GB/T 7714 | Zhang, Lu,Chen, Jiaxing,Li, Shuaicheng. More accurate models for detecting gene-gene interactions from public expression compendia[C], 2017: 1871-1878. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment