Machine Learning Estimation of Atom Condensed Fukui Functions.

作者: Qingyou Zhang , Fangfang Zheng , Tanfeng Zhao , Xiaohui Qu , João Aires-de-Sousa

DOI: 10.1002/MINF.201500113

关键词: Rank (linear algebra)Fukui functionKernel (statistics)Artificial intelligenceQuantum chemistryChemistryMachine learningOrganic moleculesAtomMoleculeQuantitative structure–activity relationship

摘要: To enable the fast estimation of atom condensed Fukui functions, machine learning algorithms were trained with databases DFT pre-calculated values for ca. 23,000 atoms in organic molecules. The problem was approached as ranking types Bradley-Terry (BT) model, and regression function. Random Forests (RF) to predict function, rank a molecule, classify high/low Atomic descriptors based on counts spheres around kernel atom. BT coefficients assigned enabled identification (93-94 % accuracy) highest function pairs same molecule differences ≥0.1. In whole molecules, top could be recognized 50 % cases and, average, about 3 4 shortlist 4. Regression RF yielded predictions test sets R(2) =0.68-0.69, improving ability molecule. Atom classification (as function) obtained sensitivity 55-61 % specificity 94-95 %.

参考文章(21)
Michael Gaus, Qiang Cui, Marcus Elstner, DFTB3: Extension of the self-consistent-charge density-functional tight-binding method (SCC-DFTB). Journal of Chemical Theory and Computation. ,vol. 7, pp. 931- 948 ,(2011) , 10.1021/CT100684S
Goedele Roos, Paul Geerlings, Joris Messens, Enzymatic catalysis: the emerging role of conceptual density functional theory. Journal of Physical Chemistry B. ,vol. 113, pp. 13465- 13475 ,(2009) , 10.1021/JP9034584
Qingyou Zhang, Fangfang Zheng, Rui Fartaria, Diogo A.R.S. Latino, Xiaohui Qu, Tiago Campos, Tanfeng Zhao, João Aires-de-Sousa, A QSPR approach for the fast estimation of DFT/NBO partial atomic charges☆ Chemometrics and Intelligent Laboratory Systems. ,vol. 134, pp. 158- 163 ,(2014) , 10.1016/J.CHEMOLAB.2014.03.011
S.R. Bahn, K.W. Jacobsen, An object-oriented scripting interface to a legacy electronic structure code computational science and engineering. ,vol. 4, pp. 56- 66 ,(2002) , 10.1109/5992.998641
Falgun Shah, Prasenjit Mukherjee, Jiri Gut, Jennifer Legac, Philip J. Rosenthal, Babu L. Tekwani, Mitchell A. Avery, Identification of novel malarial cysteine protease inhibitors using structure-based virtual screening of a focused cysteine protease inhibitor library. Journal of Chemical Information and Modeling. ,vol. 51, pp. 852- 864 ,(2011) , 10.1021/CI200029Y
Robert G. Parr, Weitao Yang, Density functional approach to the frontier-electron theory of chemical reactivity Journal of the American Chemical Society. ,vol. 106, pp. 4049- 4050 ,(1984) , 10.1021/JA00326A036
RALPH ALLAN BRADLEY, MILTON E. TERRY, RANK ANALYSIS OF INCOMPLETE BLOCK DESIGNS THE METHOD OF PAIRED COMPARISONS Biometrika. ,vol. 39, pp. 324- 345 ,(1952) , 10.1093/BIOMET/39.3-4.324
Michael W. Schmidt, Kim K. Baldridge, Jerry A. Boatz, Steven T. Elbert, Mark S. Gordon, Jan H. Jensen, Shiro Koseki, Nikita Matsunaga, Kiet A. Nguyen, Shujun Su, Theresa L. Windus, Michel Dupuis, John A. Montgomery, General atomic and molecular electronic structure system Journal of Computational Chemistry. ,vol. 14, pp. 1347- 1363 ,(1993) , 10.1002/JCC.540141112
Debasis Sengupta, Asit K. Chandra, Minh Tho Nguyen, Regioselectivity of oxetane formation in the photocycloaddition of lowest (3)(n,pi*) state of carbonyl compounds: Interpretation using local softness Journal of Organic Chemistry. ,vol. 62, pp. 6404- 6406 ,(1997) , 10.1021/JO970353P
John J. Irwin, Teague Sterling, Michael M. Mysinger, Erin S. Bolstad, Ryan G. Coleman, ZINC: A Free Tool to Discover Chemistry for Biology Journal of Chemical Information and Modeling. ,vol. 52, pp. 1757- 1768 ,(2012) , 10.1021/CI3001277