A theoretical investigation of several model selection criteria for dimensionality reduction

作者: Shikui Tu , Lei Xu

DOI: 10.1016/J.PATREC.2012.01.010

关键词:

摘要: Based on the problem of determining hidden dimensionality (or number latent factors) Factor Analysis (FA) model, this paper provides a theoretic comparison several classical model selection criteria, including Akaike's Information Criterion (AIC), Bozdogan's Consistent (CAIC), Hannan-Quinn information criterion (HQC), Schwarz's Bayesian (BIC). We focus building up partial order relative underestimation tendency. The is shown to be AIC, HQC, BIC, and CAIC, indicating probabilities from small large. This indicates an performances great extent, because underestimations usually take major proportion wrong selections when sample size population signal-to-noise ratio (SNR, defined as smallest variance dimensions noise) decrease. Synthetic experiments by varying values SNR training N verify theoretical results.

参考文章(32)
H. L. Le Roy, L. Lecam, J. Neyman, Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability; Vol. IV Revue de l'Institut International de Statistique / Review of the International Statistical Institute. ,vol. 37, pp. 230- ,(1969) , 10.2307/1402306
Iain M. Johnstone, High dimensional statistical inference and random matrices Proceedings oh the International Congress of Mathematicians: Madrid, August 22-30,2006 : invited lectures, Vol. 1, 2006, ISBN 978-3-03719-022-7, págs. 307-333. pp. 307- 333 ,(2006) , 10.4171/022
Shikui Tu, Lei Xu, An investigation of several typical model selection criteria for detecting the number of signals Frontiers of Electrical and Electronic Engineering in China. ,vol. 6, pp. 245- 255 ,(2011) , 10.1007/S11460-011-0146-Y
E. J. Hannan, B. G. Quinn, The Determination of the Order of an Autoregression Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 41, pp. 190- 195 ,(1979) , 10.1111/J.2517-6161.1979.TB01072.X
Iain M. Johnstone, On the distribution of the largest eigenvalue in principal components analysis Annals of Statistics. ,vol. 29, pp. 295- 327 ,(2001) , 10.1214/AOS/1009210544
Herman Rubin, T.W. Anderson, Statistical Inference in Factor Analysis Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, Volume 5: Contributions to Econometrics, Industrial Research, and Psychometry. ,vol. 5, pp. 111- 150 ,(1956)
Shikui Tu, Lei Xu, Theoretical Analysis and Comparison of Several Criteria on Linear Model Dimension Reduction international conference on independent component analysis and signal separation. pp. 154- 162 ,(2009) , 10.1007/978-3-642-00599-2_20
J.D. Tubbs, W.A. Coberly, D.M. Young, Linear dimension reduction and Bayes classification with unknown population parameters Pattern Recognition. ,vol. 15, pp. 167- 172 ,(1982) , 10.1016/0031-3203(82)90068-1