Deep learning models for modeling cellular transcription systems

作者: Lujia Chen

DOI:

关键词:

摘要: Cellular signal transduction system (CSTS) plays a fundamental role in maintaining homeostasis of cell by detecting changes its environment and orchestrates response. Perturbations CSTS lead to diseases such as cancers. Almost all CSTSs are involved regulating the expression certain genes leading signature gene expression. Therefore, profile is readout state could be used infer CSTS. However, convoluted mixture responses active signaling pathways cells. Therefore it difficult find associated with an individual pathway. An efficient way de-convoluting signals embedded needed. At beginning thesis, we applied Pearson correlation coefficient analysis study cellular transduced from ceramide species (lipids) genes. We found significant correlations between specific or groups showed that various dihydroceramide families regulated distinct subsets target predicted participate biologic processes. it’s well known pathway structure hierarchical. Useful information may not fully detected if only linear models More complex non-linear needed represent hierarchical This motivated us investigate contemporary deep learning (DLMs). Later, learn distributed representation statistical structures transcriptomic data. The organization machinery. Besides, they provide abstract data flexibility different degrees granularity. were capable biologically sensible representations (e.g., hidden units first layer transcription factors) revealing novel insights regarding machinery also model outperformed state-of-the-art methods Elastic-Net Linear Regression, Support Vector Machine Non-Negative Matrix Factorization.

参考文章(114)
Kahn Rhrissorrakrai, Vincenzo Belcastro, Erhan Bilal, Raquel Norel, Carine Poussin, Carole Mathis, Rémi H. J. Dulize, Nikolai V. Ivanov, Leonidas Alexopoulos, J. Jeremy Rice, Manuel C. Peitsch, Gustavo Stolovitzky, Pablo Meyer, Julia Hoeng, Understanding the limits of animal models as predictors of human biology: lessons learned from the sbv IMPROVER Species Translation Challenge Bioinformatics. ,vol. 31, pp. 471- 483 ,(2015) , 10.1093/BIOINFORMATICS/BTU611
Chaitanya Ekanadham, Andrew Y. Ng, Honglak Lee, Sparse deep belief net model for visual area V2 neural information processing systems. ,vol. 20, pp. 873- 880 ,(2007)
Robert Tibshirani, Regression Shrinkage and Selection Via the Lasso Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 58, pp. 267- 288 ,(1996) , 10.1111/J.2517-6161.1996.TB02080.X
P. Tamayo, D. Slonim, J. Mesirov, Q. Zhu, S. Kitareewan, E. Dmitrovsky, E. S. Lander, T. R. Golub, Interpreting patterns of gene expression with self-organizing maps: Methods and application to hematopoietic differentiation Proceedings of the National Academy of Sciences of the United States of America. ,vol. 96, pp. 2907- 2912 ,(1999) , 10.1073/PNAS.96.6.2907
Esti Yeger-Lotem, Laura Riva, Linhui Julie Su, Aaron D Gitler, Anil G Cashikar, Oliver D King, Pavan K Auluck, Melissa L Geddie, Julie S Valastyan, David R Karger, Susan Lindquist, Ernest Fraenkel, Bridging high-throughput genetic and transcriptional data reveals cellular responses to alpha-synuclein toxicity Nature Genetics. ,vol. 41, pp. 316- 323 ,(2009) , 10.1038/NG.337
J.-P. Brunet, P. Tamayo, T. R. Golub, J. P. Mesirov, Metagenes and molecular pattern discovery using matrix factorization. Proceedings of the National Academy of Sciences of the United States of America. ,vol. 101, pp. 4164- 4169 ,(2004) , 10.1073/PNAS.0308531101
Yoshua Bengio, Pierre-Antoine Manzagol, Samy Bengio, Dumitru Erhan, Aaron Courville, Pascal Vincent, Why Does Unsupervised Pre-training Help Deep Learning? Journal of Machine Learning Research. ,vol. 11, pp. 625- 660 ,(2010)