Performance of Deep and Shallow Neural Networks, the Universal Approximation Theorem, Activity Cliffs, and QSAR

作者: David A. Winkler , Tu C. Le

DOI: 10.1002/MINF.201600118

关键词: Universal approximation theoremAlmost surelyProperty (programming)Deep learningQuantitative structure–activity relationshipArtificial neural networkArtificial intelligenceComputer scienceImage (mathematics)Set (abstract data type)Machine learning

摘要: Neural networks have generated valuable Quantitative Structure-Activity/Property Relationships (QSAR/QSPR) models for a wide variety of small molecules and materials properties. They grown in sophistication many their initial problems been overcome by modern mathematical techniques. QSAR studies almost always used so-called "shallow" neural which there is single hidden layer between the input output layers. Recently, new potentially paradigm-shifting type network based on Deep Learning has appeared. learning methods impressive improvements image voice recognition, are now being applied to modelling. This paper describes differences approach deep shallow networks, compares abilities predict properties test sets 15 large drug data (the kaggle set), discusses results terms Universal Approximation theorem how DNN may ameliorate or remove troublesome "activity cliffs" sets.

参考文章(39)
Frank Burden, Dave Winkler, Bayesian regularization of neural networks. Methods of Molecular Biology. ,vol. 458, pp. 25- 44 ,(2008) , 10.1007/978-1-60327-101-1_3
Dagmar Stumpfe, Jürgen Bajorath, Exploring Activity Cliffs in Medicinal Chemistry Journal of Medicinal Chemistry. ,vol. 55, pp. 2932- 2942 ,(2012) , 10.1021/JM201706B
Takehiko Nakama, Comparisons of Single- and Multiple-Hidden-Layer Neural Networks Advances in Neural Networks – ISNN 2011. pp. 270- 279 ,(2011) , 10.1007/978-3-642-21105-8_32
Tu C. Le, David A. Winkler, A Bright Future for Evolutionary Methods in Drug Design. ChemMedChem. ,vol. 10, pp. 1296- 1300 ,(2015) , 10.1002/CMDC.201500161
Kurt Hornik, Approximation capabilities of multilayer feedforward networks Neural Networks. ,vol. 4, pp. 251- 257 ,(1991) , 10.1016/0893-6080(91)90009-T
Chi-Yu Shao, Sing-Zuo Chen, Bo-Han Su, Yufeng J. Tseng, Emilio Xavier Esposito, Anton J. Hopfinger, Dependence of QSAR models on the selection of trial descriptor sets: a demonstration using nanotoxicity endpoints of decorated nanotubes. Journal of Chemical Information and Modeling. ,vol. 53, pp. 142- 158 ,(2013) , 10.1021/CI3005308
Junshui Ma, Robert P. Sheridan, Andy Liaw, George E. Dahl, Vladimir Svetnik, Deep neural nets as a method for quantitative structure-activity relationships. Journal of Chemical Information and Modeling. ,vol. 55, pp. 263- 274 ,(2015) , 10.1021/CI500747N
Stuart A. Kauffman, William G. Macready, Search strategies for applied molecular evolution Journal of Theoretical Biology. ,vol. 173, pp. 427- 440 ,(1995) , 10.1006/JTBI.1995.0074
D.A. Winkler, Neural networks in ADME and toxicity prediction Drugs of The Future. ,vol. 29, pp. 1043- 1057 ,(2004) , 10.1358/DOF.2004.029.10.863395
Bin Chen, Robert P. Sheridan, Viktor Hornak, Johannes H. Voigt, Comparison of Random Forest and Pipeline Pilot Naive Bayes in Prospective QSAR Predictions Journal of Chemical Information and Modeling. ,vol. 52, pp. 792- 803 ,(2012) , 10.1021/CI200615H