The effects of data sources, cohort selection, and outcome definition on a predictive model of risk of thirty-day hospital readmissions

作者: Colin Walsh , George Hripcsak

DOI: 10.1016/J.JBI.2014.08.006

关键词:

摘要: Display Omitted Hospitals seek to predict readmissions, but there are many proposed models.The study aims show the impact of varying factors model design on performance.Targeting reason for readmission improves discrimination. ROCs range from 0.68 0.92.Patient visit and laboratory results contribute most prediction.Performance is highly cohort-dependent. Comparing models across studies may be hard. BackgroundHospital risk prediction remains a motivated area investigation operations in light hospital readmissions reduction program through CMS. Multiple have been reported with variable discriminatory performances, it unclear how affect performance. ObjectivesTo effects three development based health record data: (1) (primary diagnosis); (2) available data types (e.g. history, results, etc); (3) cohort selection. MethodsRegularized regression (LASSO) generate predictions using prevalence sampling. Support Vector Machine (SVM) used comparison selection testing. Calibration by refitting outcome prevalence. ResultsPredicting multiple reasons resulted ROC areas ranging 0.92 congestive heart failure 0.71 syncope all-cause readmission. Visit history tests contributed predictive value; contributions varied diagnosis. Cohort definition affected performance both parametric nonparametric algorithms. Compared all patients, limiting patients whose index admission diagnoses matched decrease average 0.78 0.55 (difference 0.23, p value 0.01). plots demonstrate good calibration low mean squared error. ConclusionTargeting impacted In general, prediction; source had large performance, these difficulty comparing different modeling.

参考文章(51)
Matthew E. Lissauer, Jose J. Diaz, Mayur Narayan, Paulesh K. Shah, Nader N. Hanna, Surgical intensive care unit admission variables predict subsequent readmission. American Surgeon. ,vol. 79, pp. 583- 588 ,(2013) , 10.1177/000313481307900618
Danning He, Simon C Mathews, Anthony N Kalloo, Susan Hutfless, Mining high-dimensional administrative claims data to predict early hospital readmissions. Journal of the American Medical Informatics Association. ,vol. 21, pp. 272- 279 ,(2014) , 10.1136/AMIAJNL-2013-002151
Bradley Efron, Thomas J. DiCiccio, Bootstrap Confidence Intervals Statistical Science. ,vol. 11, pp. 189- 228 ,(1996) , 10.1214/SS/1032280214
Snehalata Dongre, Rushi Longadge, Class Imbalance Problem in Data Mining Review arXiv: Learning. ,(2013)
Gary M. Weiss, Mining with rarity ACM SIGKDD Explorations Newsletter. ,vol. 6, pp. 7- 19 ,(2004) , 10.1145/1007730.1007734
Nyla Hazratjee, Markus Agito, Rocio Lopez, Bret Lashner, Maged K Rizk, Hospital Readmissions in Patients With Inflammatory Bowel Disease The American Journal of Gastroenterology. ,vol. 108, pp. 1024- 1032 ,(2013) , 10.1038/AJG.2012.343
Pedro Almagro, Bienvenido Barreiro, Anna Ochoa de Echagüen, Salvador Quintana, Mónica Rodríguez Carballeira, José L. Heredia, Javier Garau, Risk Factors for Hospital Readmission in Patients with Chronic Obstructive Pulmonary Disease Respiration. ,vol. 73, pp. 311- 317 ,(2006) , 10.1159/000088092
Gerard Salton, Christopher Buckley, Term Weighting Approaches in Automatic Text Retrieval Information Processing and Management. ,vol. 24, pp. 323- 328 ,(1988) , 10.1016/0306-4573(88)90021-0
Aksharananda Rambachan, Timothy R. Smith, Sujata Saha, Mark K. Eskandari, Bernard R. Bendok, John Y.S. Kim, Reasons for Readmission After Carotid Endarterectomy World Neurosurgery. ,vol. 82, ,(2014) , 10.1016/J.WNEU.2013.08.020
Robert Burns, Linda Olivia Nichols, Factors predicting readmission of older general medicine patients. Journal of General Internal Medicine. ,vol. 6, pp. 389- 393 ,(1991) , 10.1007/BF02598158