Predicting tissue-specific protein functions using multi-part tensor decomposition

作者: Sameh K. Mohamed

DOI: 10.1016/J.INS.2019.08.061

关键词:

摘要: Abstract Proteins are complex molecules that play many critical functions in the human body. They expressed different tissues body where their vary depending on tissue they in. The disorder of protein interactome affects biological which results diseases. Therefore, understanding and assessing tissue-specific is essential for disease diagnostics therapeutics. However, it a hard task as requires laboratory experimentations resources expensive have limited scalability. Thus, multiple computational approaches were developed to predict functions. These managed provide predictions with high scalability efficiency. still suffer from rates false positives. In this work, we propose new method predicting using tensor factorisation multi-part embeddings. We model proteins, functions, corresponding tensor, apply learn scores all possible protein-function associations each studied tissues. then show by experimental evaluation our outperforms state-of-the-art models margin 33.3% 13% area under precision recall ROC curves respectively.

参考文章(48)
Volker Tresp, Hans-peter Kriegel, Maximilian Nickel, A Three-Way Model for Collective Learning on Multi-Relational Data international conference on machine learning. pp. 809- 816 ,(2011)
Teng-Yu Ji, Ting-Zhu Huang, Xi-Le Zhao, Tian-Hui Ma, Gang Liu, Tensor completion using total variation and low-rank matrix factorization Information Sciences. ,vol. 326, pp. 243- 257 ,(2016) , 10.1016/J.INS.2015.07.049
Volker Tresp, Evgeniy Gabrilovich, Maximilian Nickel, Kevin Murphy, A Review of Relational Machine Learning for Knowledge Graphs Proceedings of the IEEE. ,vol. 104, pp. 11- 33 ,(2016) , 10.1109/JPROC.2015.2483592
Yoshua Bengio, Xavier Glorot, Understanding the difficulty of training deep feedforward neural networks international conference on artificial intelligence and statistics. pp. 249- 256 ,(2010)
Edward M. Marcotte, Matteo Pellegrini, Michael J. Thompson, Todd O. Yeates, David Eisenberg, A combined algorithm for genome-wide prediction of protein function Nature. ,vol. 402, pp. 83- 86 ,(1999) , 10.1038/47048
Mei Lu, Xiang-Jun Zhao, Li Zhang, Fan-Zhang Li, Semi-supervised concept factorization for document clustering Information Sciences. ,vol. 331, pp. 86- 98 ,(2016) , 10.1016/J.INS.2015.10.038
Qiaozhu Mei, Meng Qu, Mingzhe Wang, Jian Tang, Ming Zhang, Jun Yan, LINE: Large-scale Information Network Embedding the web conference. pp. 1067- 1077 ,(2015) , 10.1145/2736277.2741093
Li Deng, Jianfeng Gao, Wen-tau Yih, Xiaodong He, Bishan Yang, Learning Multi-Relational Semantics Using Neural-Embedding Models. arXiv: Computation and Language. ,(2014)
Casey S Greene, Arjun Krishnan, Aaron K Wong, Emanuela Ricciotti, Rene A Zelaya, Daniel S Himmelstein, Ran Zhang, Boris M Hartmann, Elena Zaslavsky, Stuart C Sealfon, Daniel I Chasman, Garret A FitzGerald, Kara Dolinski, Tilo Grosser, Olga G Troyanskaya, Understanding multicellular function and disease with human tissue-specific networks Nature Genetics. ,vol. 47, pp. 569- 576 ,(2015) , 10.1038/NG.3259
David Warde-Farley, Sylva L. Donaldson, Ovi Comes, Khalid Zuberi, Rashad Badrawi, Pauline Chao, Max Franz, Chris Grouios, Farzana Kazi, Christian Tannus Lopes, Anson Maitland, Sara Mostafavi, Jason Montojo, Quentin Shao, George Wright, Gary D. Bader, Quaid Morris, The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function. Nucleic Acids Research. ,vol. 38, pp. 214- 220 ,(2010) , 10.1093/NAR/GKQ537