Type (I, II) errors variable selection in quantitative structure activity relationships

作者: Nematollah Omidikia , Mohsen Kompany-Zareh

DOI: 10.1016/J.CHEMOLAB.2016.01.007

关键词:

摘要: Abstract Selection of valid and informative descriptors is one the most crucial steps in QSAR studies. In this contribution, type (I, II) errors variable selection proposed as a statistically based method for selecting descriptors. Developed strategy can be considered sophisticated combination jackknifing y-randomization. Type (I) error measure descriptors' importance, it provided using each descriptor. (II) chance correlation calculated Successive projection algorithm Gram–Schmidt orthogonalization were utilized pre-selection methods initial reduction collinear uninformative variables. Selwood data including 31 molecules 53 descriptors, anti-HIV 107 160, Flour 116 1268 research. Model parameters set before after confirm adequacy novel selection.

参考文章(48)
Viviana Consonni, Davide Ballabio, Roberto Todeschini, Comments on the Definition of the Q2 Parameter for QSAR Validation Journal of Chemical Information and Modeling. ,vol. 49, pp. 1669- 1678 ,(2009) , 10.1021/CI900115Y
Ronald Fisher, The Design of Experiments ,(1935)
Roberto Todeschini, Viviana Consonni, Handbook of Molecular Descriptors ,(2002)
M.C. Ortiz, L.A. Sarabia, I. García, D. Giménez, E. Meléndez, Capability of detection and three-way data Analytica Chimica Acta. ,vol. 559, pp. 124- 136 ,(2006) , 10.1016/J.ACA.2005.11.069
Andrew G Mercader, Pablo R Duchowicz, Francisco M Fernández, Eduardo A Castro, None, Advances in the replacement and enhanced replacement method in QSAR and QSPR theories. Journal of Chemical Information and Modeling. ,vol. 51, pp. 1575- 1581 ,(2011) , 10.1021/CI200079B
Saeed Bagheri, Nematollah Omidikia, Mohsen Kompany-Zareh, Unsupervised selection of informative descriptors in QSAR study of anti-HIV activities of HEPT derivatives Chemometrics and Intelligent Laboratory Systems. ,vol. 128, pp. 135- 143 ,(2013) , 10.1016/J.CHEMOLAB.2013.08.004
M Bélen Sanz, Luis A Sarabia, Ana Herrero, M Cruz Ortiz, None, Multivariate analytical sensitivity in the determination of selenium, copper, lead and cadmium by stripping voltammetry when using soft calibration Analytica Chimica Acta. ,vol. 489, pp. 85- 94 ,(2003) , 10.1016/S0003-2670(03)00663-9
Mojtaba Shamsipur, Vali Zare-Shahabadi, Bahram Hemmateenejad, Morteza Akhond, An efficient variable selection method based on the use of external memory in ant colony optimization. Application to QSAR/QSPR studies. Analytica Chimica Acta. ,vol. 646, pp. 39- 46 ,(2009) , 10.1016/J.ACA.2009.05.005