Influence of single observations on the choice of the penalty parameter in ridge regression

作者: Kristoffer H. Hellton , Riccardo De Bin , Camilla Lingjærde

DOI:

关键词:

摘要: Penalized regression methods, such as ridge regression, heavily rely on the choice of a tuning, or penalty, parameter, which is often computed via cross-validation. Discrepancies in value penalty parameter may lead to substantial differences coefficient estimates and predictions. In this paper, we investigate effect single observations optimal tuning showing how presence influential points can dramatically change it. We distinguish between "expanders" "shrinkers", based their model complexity. Our approach supplies visual exploratory tool identify points, naturally implementable for high-dimensional data where traditional approaches usually fail. Applications real examples, both low- high-dimensional, simulation study are presented.

参考文章(22)
Roger W. Johnson, Fitting Percentage of Body Fat to Simple Body Measurements Journal of Statistics Education. ,vol. 4, ,(1996) , 10.1080/10691898.1996.11910505
Gene H. Golub, Michael Heath, Grace Wahba, Generalized Cross-Validation as a Method for Choosing a Good Ridge Parameter Technometrics. ,vol. 21, pp. 215- 223 ,(1979) , 10.1080/00401706.1979.10489751
Lei Shi, Xueren Wang, Local influence in ridge regression Computational Statistics & Data Analysis. ,vol. 31, pp. 341- 353 ,(1999) , 10.1016/S0167-9473(99)00019-5
Sylvia Moeckel, Katharina Meyer, Petra Leukel, Fabian Heudorfer, Corinna Seliger, Christina Stangl, Ulrich Bogdahn, Martin Proescholdt, Alexander Brawanski, Arabel Vollmann-Zwerenz, Markus J. Riemenschneider, Anja-Katrin Bosserhoff, Rainer Spang, Peter Hau, Response-Predictive Gene Expression Profiling of Glioma Progenitor Cells In Vitro PLoS ONE. ,vol. 9, pp. e108632- ,(2014) , 10.1371/JOURNAL.PONE.0108632
Anthony C Atkinson, Marco Riani, Forward search added-variable t-tests and the effect of masked outliers on model selection Biometrika. ,vol. 89, pp. 939- 946 ,(2002) , 10.1093/BIOMET/89.4.939
Daniel Peña, A New Statistic for Influence in Linear Regression Technometrics. ,vol. 47, pp. 1- 12 ,(2005) , 10.1198/004017004000000662
Arthur E. Hoerl, Robert W. Kennard, Ridge regression: biased estimation for nonorthogonal problems Technometrics. ,vol. 42, pp. 80- 86 ,(2000) , 10.2307/1271436
Mandakini G Patel, The effect of dietary intervention on weight gains after renal transplantation Journal of Renal Nutrition. ,vol. 8, pp. 137- 141 ,(1998) , 10.1016/S1051-2276(98)90005-X
Ann Cashion, Ansley Stanfill, Fridtjof Thomas, Lijing Xu, Thomas Sutter, James Eason, Mang Ensell, Ramin Homayouni, Expression Levels of Obesity-Related Genes Are Associated with Weight Change in Kidney Transplant Recipients PLoS ONE. ,vol. 8, pp. e59962- ,(2013) , 10.1371/JOURNAL.PONE.0059962