On the number of principal components: A test of dimensionality based on measurements of similarity between matrices

作者: Stéphane Dray

DOI: 10.1016/J.CSDA.2007.07.015

关键词:

摘要: An important problem in principal component analysis (PCA) is the estimation of correct number components to retain. PCA most often used reduce a set observed variables new lower dimensionality. The choice this dimensionality crucial step for interpretation results or subsequent analyses, because it could lead loss information (underestimation) introduction random noise (overestimation). New techniques are proposed evaluate PCA. They based on similarity measurements, singular value decomposition and permutation procedures. A simulation study conducted relative merits approaches. Results showed that one method RV coefficient very accurate seems be more efficient than other existing

参考文章(32)
P. Robert, Y. Escoufier, A Unifying Tool for Linear Multivariate Statistical Methods: The RV- Coefficient Applied Statistics. ,vol. 25, pp. 257- 265 ,(1976) , 10.2307/2347233
Jean-Paul Benzecri, STATISTICAL ANALYSIS AS A TOOL TO MAKE PATTERNS EMERGE FROM DATA Methodologies of Pattern Recognition. pp. 35- 74 ,(1969) , 10.1016/B978-1-4832-3093-1.50009-2
Jean Thioulouse, Daniel Chessel, Anne B Dufour, The ade4 package - I : One-table methods R News. ,vol. 4, pp. 5- 10 ,(2004)
Wolfgang Härdle, Léopold Simar, Computer Intensive Methods in Statistics ,(1994)
Louis Ferré, Selection of components in principal component analysis: a comparison of methods Computational Statistics & Data Analysis. ,vol. 19, pp. 669- 682 ,(1995) , 10.1016/0167-9473(94)00020-J
James C. Lingoes, Peter H. Schönemann, Alternative measures of fit for the Schönemann-carroll matrix fitting algorithm Psychometrika. ,vol. 39, pp. 423- 427 ,(1974) , 10.1007/BF02291666
Philippe Besse, PCA stability and choice of dimensionality Statistics & Probability Letters. ,vol. 13, pp. 405- 410 ,(1992) , 10.1016/0167-7152(92)90115-L