An Analysis of Attention over Clinical Notes for Predictive Tasks

作者: Sarthak Jain , Ramin Mohammadi , Byron C. Wallace

DOI: 10.18653/V1/W19-1902

关键词: InterpretabilityComputer scienceDomain (software engineering)Order (exchange)Key (cryptography)Data scienceNatural languageEncoderTransparency (graphic)Medical record

摘要: The shift to electronic medical records (EMRs) has engendered research into machine learning and natural language technologies analyze patient records, predict from these clinical outcomes of interest. Two observations motivate our aims here. First, unstructured notes contained within EMR often contain key information, hence should be exploited by models. Second, while strong predictive performance is important, interpretability models perhaps equally so for applications in this domain. Together, points suggest that neural may benefit incorporation attention over notes, which one hope will both yield gains afford transparency predictions. In work we perform experiments explore question using two corpora four different tasks, that: (i) inclusion mechanisms critical encoder modules operate fields order competitive performance, but, (ii) unfortunately, boost it decidedly less clear whether they provide meaningful support

参考文章(13)
Marzyeh Ghassemi, Tristan Naumann, Finale Doshi-Velez, Nicole Brimmer, Rohit Joshi, Anna Rumshisky, Peter Szolovits, Unfolding physiological state: mortality modelling in intensive care units knowledge discovery and data mining. ,vol. 2014, pp. 75- 84 ,(2014) , 10.1145/2623330.2623742
Charles Elkan, Zachary C. Lipton, David C. Kale, Randall Wetzel, Learning to Diagnose with LSTM Recurrent Neural Networks arXiv: Learning. ,(2015)
Alistair E.W. Johnson, Tom J. Pollard, Lu Shen, Li-wei H. Lehman, Mengling Feng, Mohammad Ghassemi, Benjamin Moody, Peter Szolovits, Leo Anthony Celi, Roger G. Mark, MIMIC-III, a freely accessible critical care database Scientific Data. ,vol. 3, pp. 160035- 160035 ,(2016) , 10.1038/SDATA.2016.35
F Ginter, S Pyysalo, T Salakoski, S Ananiadou, H Moen, Distributional Semantics Resources for Biomedical Text Processing In: Proceedings of LBM 2013; 2013. p. 39-44.. pp. 39- 44 ,(2013)
Will Monroe, Dan Jurafsky, Jiwei Li, Understanding Neural Networks through Representation Erasure arXiv: Computation and Language. ,(2016)
Yu-Wei Lin, Yuqian Zhou, Faraz Faghri, Michael J Shaw, Roy H Campbell, None, Analysis and Prediction of Unplanned Intensive Care Unit Readmission using Recurrent Neural Networks with Long Short-Term Memory bioRxiv. pp. 385518- ,(2018) , 10.1101/385518
Sarthak Jain, Byron C. Wallace, Attention is not Explanation. north american chapter of the association for computational linguistics. pp. 3543- 3556 ,(2019) , 10.18653/V1/N19-1357
Shi Feng, Eric Wallace, Alvin Grissom II, Mohit Iyyer, Pedro Rodriguez, Jordan Boyd-Graber, Pathologies of Neural Models Make Interpretations Difficult empirical methods in natural language processing. pp. 3719- 3728 ,(2018) , 10.18653/V1/D18-1407
Andrew Slavin Ross, Michael C. Hughes, Finale Doshi-Velez, Right for the Right Reasons: Training Differentiable Models by Constraining their Explanations. international joint conference on artificial intelligence. pp. 2662- 2670 ,(2017) , 10.24963/IJCAI.2017/371
Yoshua Bengio, Kyunghyun Cho, Dzmitry Bahdanau, Neural Machine Translation by Jointly Learning to Align and Translate international conference on learning representations. ,(2015)