Extended Bayesian information criteria for model selection with large model spaces

作者: J. Chen , Z. Chen

DOI: 10.1093/BIOMET/ASN034

关键词:

摘要: SUMMARY The ordinary Bayesian information criterion is too liberal for model selection when the space large. In this paper, we re-examine paradigm and propose an extended family of criteria, which take into account both number unknown parameters complexity space. Their consistency established, in particular allowing covariates to increase infinity with sample size. performance various situations evaluated by simulation studies. It demonstrated that criteria incur a small loss positive rate but tightly control false discovery rate, desirable property many applications. are extremely useful variable problems moderate size huge covariates, especially genome-wide association studies, now active area genetics research.

参考文章(25)
M. Stone, Cross-Validatory Choice and Assessment of Statistical Predictions (With Discussion) Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 38, pp. 102- 102 ,(1976) , 10.1111/J.2517-6161.1976.TB01573.X
A. I. McLeod, Ying Zhang, Improved Subset Autoregression: With R Package Journal of Statistical Software. ,vol. 28, pp. 1- 28 ,(2008) , 10.18637/JSS.V028.I02
James O Berger, Luis R Pericchi, JK Ghosh, Tapas Samanta, Fulvio De Santis, JO Berger, LR Pericchi, Objective Bayesian Methods for Model Selection: Introduction and Comparison Institute of Mathematical Statistics. pp. 135- 207 ,(2001) , 10.1214/LNMS/1215540968
M. Csörgö, Lajos Horváth, Limit theorems in change-point analysis Wiley. ,(1997)
Jonathan Marchini, Peter Donnelly, Lon R Cardon, None, Genome-wide strategies for detecting multiple loci that influence complex diseases Nature Genetics. ,vol. 37, pp. 413- 417 ,(2005) , 10.1038/NG1537
RADHAKRISHNA RAO, YUEHUA WU, A strongly consistent procedure for model selection in a regression problem Biometrika. ,vol. 76, pp. 369- 374 ,(1989) , 10.1093/BIOMET/76.2.369
Nicolai Meinshausen, Peter Bühlmann, High-dimensional graphs and variable selection with the Lasso Annals of Statistics. ,vol. 34, pp. 1436- 1462 ,(2006) , 10.1214/009053606000000281
S. S. Wilks, The Large-Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses Annals of Mathematical Statistics. ,vol. 9, pp. 60- 62 ,(1938) , 10.1214/AOMS/1177732360