What's in a Note? Unpacking Predictive Value in Clinical Note Representations.

作者: Peter Szolovits , Tristan Naumann , Willie Boag , Dustin Doss

DOI:

关键词: MEDLINENarrativeRepresentation (mathematics)Computer scienceCode (semiotics)Data scienceInformation extractionTask (project management)UnpackingSimple (philosophy)

摘要: Electronic Health Records (EHRs) have seen a rapid increase in adoption during the last decade. The narrative prose contained clinical notes is unstructured and unlocking its full potential has proved challenging. Many studies incorporating applied simple information extraction models to build representations that enhance downstream prediction task, such as mortality or readmission. Improved predictive performance suggests "good" representation. However, these extrinsic evaluations are blind most of insight notes. In order better understand power expressive prose, we investigate both intrinsic methods for understanding several common note representations. To ensure replicability support modeling community, run all experiments on publicly-available data provide our code.

参考文章(16)
Omer Levy, Yoav Goldberg, Ido Dagan, Improving Distributional Similarity with Lessons Learned from Word Embeddings Transactions of the Association for Computational Linguistics. ,vol. 3, pp. 211- 225 ,(2015) , 10.1162/TACL_A_00134
Rimma Pivovarov, Adler J. Perotte, Edouard Grave, John Angiolillo, Chris H. Wiggins, Noémie Elhadad, Learning probabilistic phenotypes from heterogeneous EHR data Journal of Biomedical Informatics. ,vol. 58, pp. 156- 165 ,(2015) , 10.1016/J.JBI.2015.10.001
Raphael Cohen, Iddo Aviram, Michael Elhadad, Noémie Elhadad, Redundancy-Aware Topic Modeling for Patient Record Notes PLoS ONE. ,vol. 9, pp. e87555- 7 ,(2014) , 10.1371/JOURNAL.PONE.0087555
Marzyeh Ghassemi, Tristan Naumann, Finale Doshi-Velez, Nicole Brimmer, Rohit Joshi, Anna Rumshisky, Peter Szolovits, Unfolding physiological state: mortality modelling in intensive care units knowledge discovery and data mining. ,vol. 2014, pp. 75- 84 ,(2014) , 10.1145/2623330.2623742
Sepp Hochreiter, Jürgen Schmidhuber, Long short-term memory Neural Computation. ,vol. 9, pp. 1735- 1780 ,(1997) , 10.1162/NECO.1997.9.8.1735
B. Kosko, Bidirectional associative memories systems man and cybernetics. ,vol. 18, pp. 49- 60 ,(1988) , 10.1109/21.87054
Alon Halevy, Peter Norvig, Fernando Pereira, The Unreasonable Effectiveness of Data IEEE Intelligent Systems. ,vol. 24, pp. 8- 12 ,(2009) , 10.1109/MIS.2009.36
Karla L. Caballero Barajas, Ram Akella, Dynamically Modeling Patient's Health State from Electronic Medical Records: A Time Series Approach knowledge discovery and data mining. pp. 69- 78 ,(2015) , 10.1145/2783258.2783289
Michele Banko, Eric Brill, Scaling to very very large corpora for natural language disambiguation Proceedings of the 39th Annual Meeting on Association for Computational Linguistics - ACL '01. pp. 26- 33 ,(2001) , 10.3115/1073012.1073017
Li-Wei H. Lehman, Roger G. Mark, Mohammed Saeed, William J. Long, Joon Lee, Risk stratification of ICU patients using topic models inferred from unstructured progress notes. american medical informatics association annual symposium. ,vol. 2012, pp. 505- 511 ,(2012)