An investigation of several typical model selection criteria for detecting the number of signals

作者: Shikui Tu , Lei Xu

DOI: 10.1007/S11460-011-0146-Y

关键词: Artificial intelligenceModel selectionPrincipal component analysisMinimaxStatistical hypothesis testingBayesian information criterionBayesian probabilityAkaike information criterionMathematicsPattern recognitionSample size determination

摘要: Based on the problem of detecting number signals, this paper provides a systematic empirical investigation model selection performances several classical criteria and recently developed methods (including Akaike’s information criterion (AIC), Schwarz’s Bayesian criterion, Bozdogan’s consistent AIC, Hannan-Quinn Minka’s (MK) principal component analysis (PCA) Kritchman & Nadler’s hypothesis tests (KN), Perry Wolfe’s minimax rank estimation thresholding algorithm (MM), Ying-Yang (BYY) harmony learning), by varying signal-to-noise ratio (SNR) training sample size N. A family indifference curves is defined contour lines accuracies, such that we can examine joint effect N SNR rather than merely either with other fixed as usually done in literature. The visually reveal all demonstrate relative advantages obviously within region moderate SNR. Moreover, importance studying also confirmed an alternative reference maximizing testing likelihood. It has been shown via extensive simulations AIC BYY learning, well MK, KN, MM, are relatively more robust others against decreasing SNR, superior for small size.

参考文章(35)
H. L. Le Roy, L. Lecam, J. Neyman, Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability; Vol. IV Revue de l'Institut International de Statistique / Review of the International Statistical Institute. ,vol. 37, pp. 230- ,(1969) , 10.2307/1402306
Iain M. Johnstone, High dimensional statistical inference and random matrices Proceedings oh the International Congress of Mathematicians: Madrid, August 22-30,2006 : invited lectures, Vol. 1, 2006, ISBN 978-3-03719-022-7, págs. 307-333. pp. 307- 333 ,(2006) , 10.4171/022
E. J. Hannan, B. G. Quinn, The Determination of the Order of an Autoregression Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 41, pp. 190- 195 ,(1979) , 10.1111/J.2517-6161.1979.TB01072.X
Iain M. Johnstone, On the distribution of the largest eigenvalue in principal components analysis Annals of Statistics. ,vol. 29, pp. 295- 327 ,(2001) , 10.1214/AOS/1009210544
Herman Rubin, T.W. Anderson, Statistical Inference in Factor Analysis Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, Volume 5: Contributions to Econometrics, Industrial Research, and Psychometry. ,vol. 5, pp. 111- 150 ,(1956)
Lei Xu, Bayesian Ying-Yang system, best harmony learning, and five action circling Frontiers of Electrical and Electronic Engineering in China. ,vol. 5, pp. 281- 328 ,(2010) , 10.1007/S11460-010-0108-9
Shikui Tu, Lei Xu, Theoretical Analysis and Comparison of Several Criteria on Linear Model Dimension Reduction international conference on independent component analysis and signal separation. pp. 154- 162 ,(2009) , 10.1007/978-3-642-00599-2_20
Shikui Tu, Lei Xu, Parameterizations make different model selections: Empirical findings from factor analysis Frontiers of Electrical and Electronic Engineering in China. ,vol. 6, pp. 256- 274 ,(2011) , 10.1007/S11460-011-0150-2