A comparison of various software tools for dealing with missing data via imputation

作者: José Cortiñas Abrahantes , Cristina Sotto , Geert Molenberghs , Geert Vromman , Bart Bierinckx

DOI: 10.1080/00949655.2010.498788

关键词:

摘要: In real-life situations, we often encounter data sets containing missing observations. Statistical methods that address missingness have been extensively studied in recent years. One of the more popular approaches involves imputation values prior to analysis, thereby rendering complete. Imputation broadly encompasses an entire scope techniques developed make inferences about incomplete data, ranging from very simple strategies (e.g. mean imputation) advanced require estimation, for instance, posterior distributions using Markov chain Monte Carlo methods. Additional complexity arises when number patterns increases and/or both categorical and continuous random variables are involved. Implementation routines, procedures, or packages capable generating imputations now widely available. We review some these context a motivating example, as well simulation study,...

参考文章(37)
Geert. Molenberghs, Michael G. Kenward, Missing Data in Clinical Studies ,(2007)
P J Diggle, M G Kenward, Informative dropout in longitudinal data analysis (with discussion) Journal of The Royal Statistical Society Series C-applied Statistics. ,vol. 43, pp. 49- 94 ,(1994)
E H S Ip, J Diebolt, Stochastic EM: method and application ,(1996)
Geert Molenberghs, Models for Discrete Longitudinal Data ,(2005)
Richard D. Gill, Mark J. van der Laan, James M. Robins, Coarsening at Random: Characterizations, Conjectures, Counter-Examples State of the Art in Survival Analysis, Springer Lecture Notes in Statistics. ,vol. 123, pp. 255- 294 ,(1997) , 10.1007/978-1-4684-6316-3_14
Richard A Olshen, Charles J Stone, Leo Breiman, Jerome H Friedman, Classification and regression trees ,(1983)
Joseph G Ibrahim, Ming-Hui Chen, Stuart R Lipsitz, Amy H Herring, Missing-Data Methods for Generalized Linear Models Journal of the American Statistical Association. ,vol. 100, pp. 332- 346 ,(2005) , 10.1198/016214504000001844
Juned Siddique, Ofer Harel, MIDAS: A SAS macro for multiple imputation using distance-aided selection of donors Journal of Statistical Software. ,vol. 29, pp. 1- 18 ,(2009) , 10.18637/JSS.V029.I09
Xiao-Li Meng, Missing Data: Dial M for ??? Journal of the American Statistical Association. ,vol. 95, pp. 1325- 1330 ,(2000) , 10.1080/01621459.2000.10474341