SimBoost: a read-across approach for predicting drug-target binding affinities using gradient boosting machines

作者: Tong He , Marten Heidemeyer , Fuqiang Ban , Artem Cherkasov , Martin Ester

DOI: 10.1186/S13321-017-0209-Z

关键词:

摘要: Computational prediction of the interaction between drugs and targets is a standing challenge in field drug discovery. A number rather accurate predictions were reported for various binary drug–target benchmark datasets. However, notable drawback representation data that missing endpoints non-interacting pairs are not differentiated from inactive cases, predicted levels activity depend on pre-defined binarization thresholds. In this paper, we present method called SimBoost predicts continuous (non-binary) values binding affinities compounds proteins thus incorporates whole spectrum true negative to positive interactions. Additionally, propose version SimBoostQuant which computes interval order assess confidence affinity, defining Applicability Domain metrics explicitly. We evaluate two established datasets one new dataset use as read-across cheminformatics applications. demonstrate our methods outperform previously models across studied

参考文章(48)
Nicolai Meinshausen, Quantile Regression Forests Journal of Machine Learning Research. ,vol. 7, pp. 983- 999 ,(2006)
Jerome H. Friedman, Greedy function approximation: A gradient boosting machine. Annals of Statistics. ,vol. 29, pp. 1189- 1232 ,(2001) , 10.1214/AOS/1013203451
Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand, Victor Lempitsky, Domain-Adversarial Training of Neural Networks Domain Adaptation in Computer Vision Applications. ,vol. 17, pp. 189- 209 ,(2017) , 10.1007/978-3-319-58347-1_10
Rajeev Motwani, Terry Winograd, Lawrence Page, Sergey Brin, The PageRank Citation Ranking : Bringing Order to the Web the web conference. ,vol. 98, pp. 161- 172 ,(1999)
Mark Newman, Networks: An Introduction ,(2010)
Yvonne Y. Li, Jianghong An, Steven J. M. Jones, A Computational Approach to Finding Novel Targets for Existing Drugs PLoS Computational Biology. ,vol. 7, pp. e1002139- ,(2011) , 10.1371/JOURNAL.PCBI.1002139
Vladimir Svetnik, Ting Wang, Christopher Tong, Andy Liaw, Robert P. Sheridan, Qinghua Song, Boosting: an ensemble learning tool for compound classification and QSAR modeling. Journal of Chemical Information and Modeling. ,vol. 45, pp. 786- 799 ,(2005) , 10.1021/CI0500379
Nobuyoshi Nagamine, Takayuki Shirakawa, Yusuke Minato, Kentaro Torii, Hiroki Kobayashi, Masaya Imoto, Yasubumi Sakakibara, Integrating Statistical Predictions and Experimental Verifications for Enhancing Protein-Chemical Interaction Predictions in Virtual Screening PLoS Computational Biology. ,vol. 5, pp. e1000397- ,(2009) , 10.1371/JOURNAL.PCBI.1000397
Alan C Cheng, Ryan G Coleman, Kathleen T Smyth, Qing Cao, Patricia Soulard, Daniel R Caffrey, Anna C Salzberg, Enoch S Huang, Structure-based maximal affinity model predicts small-molecule druggability. Nature Biotechnology. ,vol. 25, pp. 71- 75 ,(2007) , 10.1038/NBT1273