Cohort selection for clinical trials using multiple instance learning

作者: Hong-Jie Dai , Feng-Duo Wang , Chih-Wei Chen , Chu-Hsien Su , Chi-Shin Wu

DOI: 10.1016/J.JBI.2020.103438

关键词:

摘要: Abstract Identifying patients eligible for clinical trials using electronic health records (EHRs) is a challenging task usually requiring comprehensive analysis of information stored in multiple EHRs patient. The goal this study to investigate different methods and their effectiveness identifying that meet specific eligibility selection criteria based on patients’ longitudinal records. An unstructured dataset released by the n2c2 cohort track was used, each which included 2–5 manually annotated thirteen pre-defined criteria. Unlike other studies, we formulated problem as instance learning (MIL) compared performance with rule-based single instance-based classifiers. Our official best run achieved an average micro-F score 0.8765 ranked one top ten results track. Further experiments demonstrated MIL-based classifiers consistently yield better than single-instance counterparts require overall comprehension distributed among all patient’s EHRs. Rule-based approaches exhibited don’t consideration several factors across This patient can be MIL problem. exhibit supplement provide comparison approaches.

参考文章(29)
Chunhua Weng, Herbert S. Chase, Carol Friedman, Li Li, Chintan O. Patel, Comparing ICD9-encoded diagnoses and NLP-processed discharge summaries for clinical trials pre-screening: a case study. american medical informatics association annual symposium. ,vol. 2008, pp. 404- 408 ,(2008)
Peter A. Flach, Thomas Gärtner, Alex J. Smola, Adam Kowalczyk, Multi-Instance Kernels international conference on machine learning. pp. 179- 186 ,(2002)
Nils Weidmann, Eibe Frank, Bernhard Pfahringer, A two-level learning method for generalized multi-instance problems european conference on machine learning. pp. 468- 479 ,(2003) , 10.1007/978-3-540-39857-8_42
Lin Dong (M. Sc.), A Comparison of Multi-instance Learning Algorithms The University of Waikato. ,(2006)
Chaitanya Shivade, Preethi Raghavan, Eric Fosler-Lussier, Peter J Embi, Noemie Elhadad, Stephen B Johnson, Albert M Lai, A review of approaches to identifying patient phenotype cohorts using electronic health records Journal of the American Medical Informatics Association. ,vol. 21, pp. 221- 230 ,(2014) , 10.1136/AMIAJNL-2013-001935
Hong-Jie Dai, Shabbir Syed-Abdul, Chih-Wei Chen, Chieh-Chen Wu, Recognition and Evaluation of Clinical Section Headings in Clinical Documents Using Token-Based Formulation with Conditional Random Fields BioMed Research International. ,vol. 2015, pp. 873012- 873012 ,(2015) , 10.1155/2015/873012
Jiajun Wu, Yinan Yu, Chang Huang, Kai Yu, Deep multiple instance learning for image classification and auto-annotation computer vision and pattern recognition. pp. 3460- 3469 ,(2015) , 10.1109/CVPR.2015.7298968
Ishna Neamatullah, Margaret M Douglass, Li-wei H Lehman, Andrew Reisner, Mauricio Villarroel, William J Long, Peter Szolovits, George B Moody, Roger G Mark, Gari D Clifford, Automated de-identification of free-text medical records BMC Medical Informatics and Decision Making. ,vol. 8, pp. 32- 32 ,(2008) , 10.1186/1472-6947-8-32
Jaume Amores, Multiple instance classification: Review, taxonomy and comparative study Artificial Intelligence. ,vol. 201, pp. 81- 105 ,(2013) , 10.1016/J.ARTINT.2013.06.003
Lynne Penberthy, Richard Brown, Federico Puma, Bassam Dahman, Automated matching software for clinical trials eligibility: measuring efficiency and flexibility. Contemporary Clinical Trials. ,vol. 31, pp. 207- 217 ,(2010) , 10.1016/J.CCT.2010.03.005