Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities

作者: Jure Leskovec , Francis Nguyen , Francis Nguyen , Anna Goldenberg , Michael M. Hoffman

DOI: 10.1016/J.INFFUS.2018.09.012

关键词:

摘要: New technologies have enabled the investigation of biology and human health at an unprecedented scale in multiple dimensions. These dimensions include a myriad properties describing genome, epigenome, transcriptome, microbiome, phenotype, lifestyle. No single data type, however, can capture complexity all factors relevant to understanding phenomenon such as disease. Integrative methods that combine from thus emerged critical statistical computational approaches. The key challenge developing approaches is identification effective models provide comprehensive systems view. An ideal method answer biological or medical question, identifying important features predicting outcomes, by harnessing heterogeneous across several variation. In this Review, we describe principles integration discuss current available implementations. We examples successful medicine. Finally, challenges biomedical integrative our perspective on future development field.

参考文章(343)
Robert Sorrentino, Jianying Hu, Fei Wang, Ping Zhang, Exploring the relationship between drug side-effects and therapeutic indications. american medical informatics association annual symposium. ,vol. 2013, pp. 1568- 1577 ,(2013)
Nobuo Ogawa, Mark D. Biggin, High-throughput SELEX determination of DNA sequences bound by transcription factors in vitro. Methods of Molecular Biology. ,vol. 786, pp. 51- 63 ,(2012) , 10.1007/978-1-61779-292-2_3
Yizhou Sun, Jiawei Han, Xifeng Yan, Philip S. Yu, Tianyi Wu, PathSim Proceedings of the VLDB Endowment. ,vol. 4, pp. 992- 1003 ,(2011) , 10.14778/3402707.3402736
Ping Zhang, Pankaj Agarwal, Zoran Obradovic, Computational drug repositioning by ranking and integrating multiple data sources european conference on machine learning. pp. 579- 594 ,(2013) , 10.1007/978-3-642-40994-3_37
Andrew K. Rider, Nitesh V. Chawla, Scott J. Emrich, A Survey of Current Integrative Network Algorithms for Systems Biology Systems Biology. pp. 479- 495 ,(2013) , 10.1007/978-94-007-6803-1_17
Jean-Jack M. Riethoven, Regulatory Regions in DNA: Promoters, Enhancers, Silencers, and Insulators Methods in Molecular Biology. ,vol. 674, pp. 33- 42 ,(2010) , 10.1007/978-1-60761-854-6_3
Volker Tresp, Hans-peter Kriegel, Maximilian Nickel, A Three-Way Model for Collective Learning on Multi-Relational Data international conference on machine learning. pp. 809- 816 ,(2011)
D. A. Cusanovich, R. Daza, A. Adey, H. A. Pliner, L. Christiansen, K. L. Gunderson, F. J. Steemers, C. Trapnell, J. Shendure, Multiplex single cell profiling of chromatin accessibility by combinatorial cellular indexing. Science. ,vol. 348, pp. 910- 914 ,(2015) , 10.1126/SCIENCE.AAB1601
Vladimir Gligorijević, Noël Malod-Dognin, Nataša Pržulj, Fuse: multiple network alignment via data fusion. Bioinformatics. ,vol. 32, pp. 1195- 1203 ,(2016) , 10.1093/BIOINFORMATICS/BTV731