Redundancy analysis for characterizing the correlation between groups of variables - Applied to molecular descriptors

作者: Kurt Varmuza , Peter Filzmoser , Bettina Liebmann , Matthias Dehmer

DOI: 10.1016/J.CHEMOLAB.2011.05.013

关键词:

摘要: Abstract Redundancy analysis (RA) estimates the extent of linear relationships between blocks variables that are given for a set objects (samples). RA has only rarely been used in chemometrics. Basic principles and limits discussed, is briefly compared with canonical correlation (CCA) partial least-squares (PLS2) regression. The significance redundancy index estimated by permutation tests. For PLS2, an determining similarity variable can be derived equivalent to measure correlation, CMC. applied 3708 molecular descriptors (created software Dragon) 6458 chemical structures (AMES database). 27 descriptor groups characterized their indices, which allow comparison multivariate information content. results guide selection most different groups, perform better discrimination task (classification mutagenicity) than entire groups.

参考文章(52)
V. Consonni, D. Ballabio, A. Manganaro, A. Mauri, R. Todeschini, Canonical Measure of Correlation (CMC) and Canonical Measure of Distance (CMD) between sets of data: Part 2. Variable reduction Analytica Chimica Acta. ,vol. 648, pp. 52- 59 ,(2009) , 10.1016/J.ACA.2009.06.035
Johann Gasteiger, Thomas Engel, Chemoinformatics: A Textbook ,(2003)
Roberto Todeschini, Viviana Consonni, Handbook of Molecular Descriptors ,(2002)
M. R. Oliveira, J. A. Branco, C. Croux, P. Filzmoser, Robust Redundancy Analysis by Alternating Regression Birkhäuser, Basel. pp. 235- 246 ,(2004) , 10.1007/978-3-0348-7958-3_21
Richard G. Brereton, Antonio R. Carvalho, Mohammad Wasim, Yun Xu, Lifeng Zhu, Simeone Zomer, Handbook of chemoinformatics: from data to knowledge, edited by Johann Gasteiger, Volumes 1–4. Wiley‐VCH, Weinheim, 2003, ISBN 3527306803, €485 Journal of Chemometrics. ,vol. 18, pp. 265- 271 ,(2004) , 10.1002/CEM.866
Agnar Höskuldsson, PLS regression methods Journal of Chemometrics. ,vol. 2, pp. 211- 228 ,(1988) , 10.1002/CEM.1180020306
Stéphanie Dronnet, Caroline Lohou, Jean-Philippe Christides, Anne-Geneviève Bagnères, Cuticular Hydrocarbon Composition Reflects Genetic Relationship Among Colonies of the Introduced Termite Reticulitermes santonensis Feytaud Journal of Chemical Ecology. ,vol. 32, pp. 1027- 1042 ,(2006) , 10.1007/S10886-006-9043-X
R. Penrose, A Generalized inverse for matrices Mathematical Proceedings of the Cambridge Philosophical Society. ,vol. 51, pp. 406- 413 ,(1955) , 10.1017/S0305004100030401