Machine learning model combining features from algorithms with different analytical methodologies to detect laboratory-event-related adverse drug reaction signals

作者: Eugene Jeong , Namgi Park , Young Choi , Rae Woong Park , Dukyong Yoon

DOI: 10.1371/JOURNAL.PONE.0207749

关键词: Adverse drug reactionMachine learningSensitivity (control systems)Artificial intelligenceRandom forestElectronic health recordAlgorithmEvent (probability theory)Support vector machineSignal processingReceiver operating characteristicComputer scienceGeneral Biochemistry, Genetics and Molecular BiologyGeneral Agricultural and Biological SciencesGeneral Medicine

摘要: Background The importance of identifying and evaluating adverse drug reactions (ADRs) has been widely recognized. Many studies have developed algorithms for ADR signal detection using electronic health record (EHR) data. In this study, we propose a machine learning (ML) model that enables accurate by integrating features from existing based on inpatient EHR laboratory results. Materials methods To construct an reference dataset, extracted known drug–laboratory event pairs represented test the EU-SPC SIDER databases. All possible pairs, except ones, are considered unknown. To detect pair, three algorithms—CERT, CLEAR, PACE—were applied to 21-year We also constructed ML models (based random forest, L1 regularized logistic regression, support vector machine, neural network) use intermediate products CERT, PACE as inputs determine whether pair is associated. For performance comparison, evaluated sensitivity, specificity, positive predictive value (PPV), negative (NPV), F1-measure, area under receiver operating characteristic (AUROC). Results All measures outperformed those with sensitivity 0.593–0.793, specificity 0.619–0.796, NPV 0.645–0.727, PPV 0.680–0.777, F1-measure 0.629–0.709, AUROC 0.737–0.816. Features related change or distribution shape were important detecting signals. Conclusions Improved indicated applying our data feasible promising more comprehensive signals.

参考文章(26)
Michael Kuhn, Ivica Letunic, Lars Juhl Jensen, Peer Bork, The SIDER database of drugs and side effects Nucleic Acids Research. ,vol. 44, pp. 1075- 1079 ,(2016) , 10.1093/NAR/GKV1075
Dukyong Yoon, Martijn J. Schuemie, Ju Han Kim, Dong Ki Kim, Man Young Park, Eun Kyoung Ahn, Eun-Young Jung, Dong Kyun Park, Soo Yeon Cho, Dahye Shin, Yeonsoo Hwang, Rae Woong Park, A normalization method for combination of laboratory test results from different electronic healthcare databases in a distributed research network. Pharmacoepidemiology and Drug Safety. ,vol. 25, pp. 307- 316 ,(2016) , 10.1002/PDS.3893
Jing Zhao, Aron Henriksson, Lars Asker, Henrik Boström, Predictive modeling of structured electronic health records for adverse drug event detection bioinformatics and biomedicine. ,vol. 15, pp. 1- 15 ,(2015) , 10.1186/1472-6947-15-S4-S1
Dukyong Yoon, Eun Kyoung Ahn, Man Young Park, Soo Yeon Cho, Patrick Ryan, Martijn J. Schuemie, Dahye Shin, Hojun Park, Rae Woong Park, Conversion and Data Quality Assessment of Electronic Health Record Data at a Korean Tertiary Teaching Hospital to a Common Data Model for Distributed Network Research Healthcare Informatics Research. ,vol. 22, pp. 54- 58 ,(2016) , 10.4258/HIR.2016.22.1.54
Abhyuday N Jagannatha, Hong Yu, Bidirectional RNN for Medical Event Detection in Electronic Health Records north american chapter of the association for computational linguistics. ,vol. 2016, pp. 473- 482 ,(2016) , 10.18653/V1/N16-1056
Jing Zhao, Aron Henriksson, Learning temporal weights of clinical events using variable importance bioinformatics and biomedicine. ,vol. 16, pp. 71- ,(2016) , 10.1186/S12911-016-0311-6
Alexandra C. Pacurariu, Preciosa M. Coloma, Liana Gross-Martirosyan, Miriam C. Sturkenboom, Sabine M. Straus, Decision making in drug safety-a literature review of criteria used to prioritize newly detected safety issues. Pharmacoepidemiology and Drug Safety. ,vol. 26, pp. 327- 334 ,(2017) , 10.1002/PDS.4128
Suehyun Lee, Jiyeob Choi, Hun-Sung Kim, Grace Juyun Kim, Kye Hwa Lee, Chan Hee Park, Jongsoo Han, Dukyong Yoon, Man Young Park, Rae Woong Park, Hye-Ryun Kang, Ju Han Kim, Standard-based comprehensive detection of adverse drug reaction signals from nursing statements and laboratory results in electronic health records. Journal of the American Medical Informatics Association. ,vol. 24, pp. 697- 708 ,(2017) , 10.1093/JAMIA/OCW168
Tamara P. Miller, Yimei Li, Kelly D. Getz, Jesse Dudley, Evanette Burrows, Jeffrey Pennington, Azada Ibrahimova, Brian T. Fisher, Rochelle Bagatell, Alix E. Seif, Robert Grundmeier, Richard Aplenc, Using electronic medical record data to report laboratory adverse events. British Journal of Haematology. ,vol. 177, pp. 283- 286 ,(2017) , 10.1111/BJH.14538