Sequential Bayesian Regression for Multiple Imputation and Conditional Editing

作者: Robin Angela Jeffries

DOI:

关键词:

摘要: Analysts faced with errors in data apply editing rules to fix erroneous data. These edits are deterministically assigned and may not be correct all cases. This dissertation presents a unified method multiply impute missing edit using sequence of Bayesian regression models. The techniques used an exact parallel for multiple imputation models presented allow different types subject several error mechanisms. is called Sequential Regression Multiple Imputation Conditional Editing (SyBRMICE) creates fully imputed edited sets. Desired analyses performed on each complete consistently set individually. Results from these combined the same combining imputation. resulting parameter estimates intervals will then correctly account incurred both processes. Development SyBRMICE was motivated by Project Connect (PC). 8 year longitudinal intervention study aiming reduce teen pregnancy STD rates select middle high schools Los Angeles area. Survey collected annually measure effectiveness interventions. A paper survey administered students as group classroom, student responses have five years. subset participated years repeated answers question student. Data found PC can categorized belonging one types. If variable such gender that should remain constant over time observed differ across surveys, this said inconsistent response. variable, age or ever having sexual intercourse, increase monotonically non-monotonic reporting pattern, monotonic Lastly if two more related variables give conflicting information, Models stochastically three presented. measures, monotone longitudinal, multivariate developed separately steps example larger unifying procedure. examples demonstrate flexibility customizability analysis consistent sets generated procedure compared results single deterministically-edited, complete-case set.

参考文章(42)
Thomas F. Petkunas, William E. Winkler, THE DISCRETE EDIT SYSTEM ,(1997)
Thomas N. Herzog, Fritz J. Scheuren, William E. Winkler, Automatic Editing and Imputation of Sample Survey Data Data Quality and Record Linkage Techniques. pp. 61- 80 ,(2007) , 10.1007/0-387-69505-2_7
Lisa R. Draper, William E. Winkler, APPLICATION OF THE SPEER EDIT SYSTEM ,(1997)
Jasmina L Vujic, Monte Carlo Sampling Methods Handbooks in Operations Research and Management Science. ,vol. 10, pp. 353- 425 ,(2003) , 10.1016/S0927-0507(03)10006-0
Ronald Christensen, Bayesian ideas and data analysis : an introduction for scientists and statisticians CRC Press, Taylor & Francis. ,(2011)
Adam Davey, Michael J. Shanahan, Joseph L. Schafer, Correcting for selective nonresponse in the National Longitudinal Survey of Youth using multiple imputation Journal of Human Resources. ,vol. 36, pp. 500- 519 ,(2001) , 10.2307/3069628
William A. Link, Mitchell J. Eaton, On thinning of chains in MCMC Methods in Ecology and Evolution. ,vol. 3, pp. 112- 115 ,(2012) , 10.1111/J.2041-210X.2011.00131.X
Patrick Royston, Ian White, Multiple Imputation by Chained Equations (MICE): Implementation inStata Journal of Statistical Software. ,vol. 45, pp. 1- 20 ,(2011) , 10.18637/JSS.V045.I04