Robust Model Selection with LARS Based on S-estimators

作者: Claudio Agostinelli , Matias Salibian-Barrera

DOI: 10.1007/978-3-7908-2604-3_6

关键词: Sample size determinationOutlierRobust regressionCovariateComputer scienceEstimatorRobustness (computer science)EconometricsModel selectionStatisticsLeast-angle regression

摘要: We consider the problem of selecting a parsimonious subset explanatory variables from potentially large collection covariates. are concerned with case when data quality may be unreliable (e.g. there might outliers among observations). When number available covariates is moderately large, fitting all possible subsets not feasible option. Sequential methods like forward or backward selection generally “greedy” and fail to include important predictors these correlated. To avoid this Efron et al. (2004) proposed Least Angle Regression algorithm produce an ordered list (sequencing) according their relevance. introduce outlier robust versions LARS based on S-estimators for regression (Rousseeuw Yohai (1984)). This computationally efficient suitable even exceeds sample size. Simulation studies show that it also presence in compares favourably previous proposals literature.

参考文章(28)
P. Rousseeuw, V. Yohai, ROBUST REGRESSION BY MEANS OF S-ESTIMATORS Springer, New York, NY. pp. 256- 272 ,(1984) , 10.1007/978-1-4615-7821-5_15
Elvezio Ronchetti, Robustness Aspects of Model Choice Statistica Sinica. ,vol. 7, pp. 327- 338 ,(1997)
Víctor J. Yohai, Ricardo A. Maronna, R. Douglas Martin, Robust Statistics: Theory and Methods ,(2006)
Robert Tibshirani, Trevor Hastie, Jerome H. Friedman, The Elements of Statistical Learning ,(2001)
S. Morgenthaler, R. E. Welsch, A. Zenide, Algorithms for Robust Model Selection in Linear Regression Theory and Applications of Recent Robust Methods. pp. 195- 206 ,(2004) , 10.1007/978-3-0348-7958-3_18
Alan J. Miller, Subset Selection in Regression ,(2002)
Matias Salibian-Barrera, Ruben H. Zamar, Bootstrapping robust estimates of regression Annals of Statistics. ,vol. 30, pp. 556- 582 ,(2002) , 10.1214/AOS/1021379865
Elvezio Ronchetti, Robert G. Staudte, A Robust Version of Mallows's C P Journal of the American Statistical Association. ,vol. 89, pp. 550- 559 ,(1994) , 10.1080/01621459.1994.10476780
Guoqi Qian, Hans R. Künsch, On model selection via stochastic complexity in robust linear regression Journal of Statistical Planning and Inference. ,vol. 75, pp. 91- 116 ,(1998) , 10.1016/S0378-3758(98)00138-4