FIDDLE: An integrative deep learning framework for functional genomic data inference

作者: Umet Eser , L. Stirling Churchman

DOI: 10.1101/081380

关键词:

摘要: Numerous advances in sequencing technologies have revolutionized genomics through generating many types of genomic functional data. Statistical tools been developed to analyze individual data types, but there lack strategies integrate disparate datasets under a unified framework. Moreover, most analysis techniques heavily rely on feature selection and preprocessing which increase the difficulty addressing biological questions integration multiple datasets. Here, we introduce FIDDLE (Flexible Integration Data with Deep LEarning) an open source data-agnostic flexible integrative framework that learns representation from infer another type. As case study, use Saccharomyces cerevisiae predict global transcription start sites (TSS) simulation TSS-seq We demonstrate type can be inferred other sources without manually specifying relevant features preprocessing. show models built genome-wide perform profoundly better than Thus complex synergistic relationship within and, importantly, across

参考文章(18)
Babak Alipanahi, Andrew Delong, Matthew T Weirauch, Brendan J Frey, Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning Nature Biotechnology. ,vol. 33, pp. 831- 838 ,(2015) , 10.1038/NBT.3300
Yoshua Bengio, Deep Learning of Representations: Looking Forward Statistical Language and Speech Processing. pp. 1- 37 ,(2013) , 10.1007/978-3-642-39593-2_1
Christian Szegedy, Sergey Ioffe, Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift international conference on machine learning. ,vol. 1, pp. 448- 456 ,(2015)
L. Stirling Churchman, Jonathan S. Weissman, Nascent transcript sequencing visualizes transcription at nucleotide resolution Nature. ,vol. 469, pp. 368- 373 ,(2011) , 10.1038/NATURE09652
Anshul Kundaje, Wouter Meuleman, Jason Ernst, Misha Bilenky, Angela Yen, Alireza Heravi-Moussavi, Pouya Kheradpour, Zhizhuo Zhang, Jianrong Wang, Michael J Ziller, Viren Amin, John W Whitaker, Matthew D Schultz, Lucas D Ward, Abhishek Sarkar, Gerald Quon, Richard S Sandstrom, Matthew L Eaton, Yi-Chieh Wu, Andreas R Pfenning, Xinchen Wang, Melina Claussnitzer, Yaping Liu, Cristian Coarfa, R Alan Harris, Noam Shoresh, Charles B Epstein, Elizabeta Gjoneska, Danny Leung, Wei Xie, R David Hawkins, Ryan Lister, Chibo Hong, Philippe Gascard, Andrew J Mungall, Richard Moore, Eric Chuah, Angela Tam, Theresa K Canfield, R Scott Hansen, Rajinder Kaul, Peter J Sabo, Mukul S Bansal, Annaick Carles, Jesse R Dixon, Kai-How Farh, Soheil Feizi, Rosa Karlic, Ah-Ram Kim, Ashwinikumar Kulkarni, Daofeng Li, Rebecca Lowdon, GiNell Elliott, Tim R Mercer, Shane J Neph, Vitor Onuchic, Paz Polak, Nisha Rajagopal, Pradipta Ray, Richard C Sallari, Kyle T Siebenthall, Nicholas A Sinnott-Armstrong, Michael Stevens, Robert E Thurman, Jie Wu, Bo Zhang, Xin Zhou, Arthur E Beaudet, Laurie A Boyer, Philip L De Jager, Peggy J Farnham, Susan J Fisher, David Haussler, Steven JM Jones, Wei Li, Marco A Marra, Michael T McManus, Shamil Sunyaev, James A Thomson, Thea D Tlsty, Li-Huei Tsai, Wei Wang, Robert A Waterland, Michael Q Zhang, Lisa H Chadwick, Bradley E Bernstein, Joseph F Costello, Joseph R Ecker, Martin Hirst, Alexander Meissner, Aleksandar Milosavljevic, Bing Ren, John A Stamatoyannopoulos, Ting Wang, Manolis Kellis, None, Integrative analysis of 111 reference human epigenomes Nature. ,vol. 518, pp. 317- 330 ,(2015) , 10.1038/NATURE14248
H. Y. Xiong, B. Alipanahi, L. J. Lee, H. Bretschneider, D. Merico, R. K. C. Yuen, Y. Hua, S. Gueroussov, H. S. Najafabadi, T. R. Hughes, Q. Morris, Y. Barash, A. R. Krainer, N. Jojic, S. W. Scherer, B. J. Blencowe, B. J. Frey, The human splicing code reveals new insights into the genetic determinants of disease Science. ,vol. 347, pp. 1254806- 1254806 ,(2015) , 10.1126/SCIENCE.1254806
Christophe Malabat, Frank Feuerbach, Laurence Ma, Cosmin Saveanu, Alain Jacquier, Quality control of transcription start site selection by nonsense-mediated-mRNA decay eLife. ,vol. 4, ,(2015) , 10.7554/ELIFE.06722
Ronan Collobert, Pavel Kuksa, Léon Bottou, Koray Kavukcuoglu, Michael Karlen, Jason Weston, Natural Language Processing (Almost) from Scratch Journal of Machine Learning Research. ,vol. 12, pp. 2493- 2537 ,(2011)
Ilya Sutskever, Geoffrey E. Hinton, Alex Krizhevsky, ImageNet Classification with Deep Convolutional Neural Networks neural information processing systems. ,vol. 25, pp. 1097- 1105 ,(2012)