Averaging causal estimators in high dimensions

作者: Matthew Cefalu , Joseph Antonelli

DOI:

关键词: EconometricsCorrelation and dependenceEstimatorConfoundingSet (psychology)Coupling (probability)Computer scienceRobustness (computer science)Estimation

摘要: There has been increasing interest in recent years the development of approaches to estimate causal effects when number potential confounders is prohibitively large. This growth led a estimators one could use this setting. Each these different operating characteristics, and it unlikely that estimator will outperform all others across possible scenarios. Coupling with fact an analyst can never know which approach best for their particular data, we propose synthetic averages over set candidate estimators. Averaging widely used statistics problems such as prediction, where there are many models, averaging improve performance increase robustness using incorrect models. We show ideas carry into estimation high-dimensional theoretically provides against choosing bad model, empirically via simulation performs quite well, most cases nearly well among Finally, illustrate environmental wide association study see largest benefit more difficult scenarios have large numbers confounders.

参考文章(34)
David Draper, Assessment and Propagation of Model Uncertainty Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 57, pp. 45- 70 ,(1995) , 10.1111/J.2517-6161.1995.TB02015.X
Ander Wilson, Brian J. Reich, Confounder selection via penalized credible regions. Biometrics. ,vol. 70, pp. 852- 861 ,(2014) , 10.1111/BIOM.12203
Chris T. Volinsky, Adrian E. Raftery, David Madigan, Jennifer A. Hoeting, Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors Statistical Science. ,vol. 14, pp. 382- 417 ,(1999) , 10.1214/SS/1009212519
Paul R. Rosenbaum, Reducing Bias in Observational Studies Using Subclassification on the Propensity Score Journal of the American Statistical Association. ,vol. 79, pp. 516- 524 ,(1984) , 10.2307/2288398
Neal Thomas, Donald B. Rubin, Matching Using Estimated Propensity Scores: Relating Theory to Practice Matched Sampling for Causal Effects. ,vol. 52, pp. 263- 281 ,(1996) , 10.1017/CBO9780511810725.024
M. Ghosh, J. N. K. Rao, Small Area Estimation: An Appraisal Statistical Science. ,vol. 9, pp. 55- 76 ,(1994) , 10.1214/SS/1177010647
Hui Zou, The adaptive lasso and its oracle properties Journal of the American Statistical Association. ,vol. 101, pp. 1418- 1429 ,(2006) , 10.1198/016214506000000735
Mark J. van der Laan, Daniel Rubin, Targeted Maximum Likelihood Learning The International Journal of Biostatistics. ,vol. 2, pp. 1- 40 ,(2006) , 10.2202/1557-4679.1043
Ron C. Mittelhammer, George G. Judge, Combining estimators to improve structural model estimation and inference under quadratic loss Journal of Econometrics. ,vol. 128, pp. 1- 29 ,(2005) , 10.1016/J.JECONOM.2004.08.006
A. Chatterjee, S. N. Lahiri, Asymptotic properties of the residual bootstrap for Lasso estimators Proceedings of the American Mathematical Society. ,vol. 138, pp. 4497- 4509 ,(2010) , 10.1090/S0002-9939-2010-10474-4