Learning to Diagnose with LSTM Recurrent Neural Networks

作者: Charles Elkan , Zachary C. Lipton , David C. Kale , Randall Wetzel

DOI:

关键词:

摘要: Clinical medical data, especially in the intensive care unit (ICU), consist of multivariate time series observations. For each patient visit (or episode), sensor data and lab test results are recorded patient's Electronic Health Record (EHR). While potentially containing a wealth insights, is difficult to mine effectively, owing varying length, irregular sampling missing data. Recurrent Neural Networks (RNNs), particularly those using Long Short-Term Memory (LSTM) hidden units, powerful increasingly popular models for learning from sequence They effectively model length sequences capture long range dependencies. We present first study empirically evaluate ability LSTMs recognize patterns clinical measurements. Specifically, we consider multilabel classification diagnoses, training classify 128 diagnoses given 13 frequently but irregularly sampled First, establish effectiveness simple LSTM network modeling Then demonstrate straightforward effective strategy which replicate targets at step. Trained only on raw series, our outperform several strong baselines, including multilayer perceptron trained hand-engineered features.

参考文章(44)
Sepp Hochreiter, Jürgen Schmidhuber, Long short-term memory Neural Computation. ,vol. 9, pp. 1735- 1780 ,(1997) , 10.1162/NECO.1997.9.8.1735
Murray M. Pollack, Kantilal M. Patel, Urs E. Ruttimann, PRISM III: An updated Pediatric Risk of Mortality score Critical Care Medicine. ,vol. 24, pp. 743- 752 ,(1996) , 10.1097/00003246-199605000-00004
Volker Tresp, Thomas Briegel, A Solution for Missing Data in Recurrent Neural Networks with an Application to Blood Glucose Prediction neural information processing systems. ,vol. 10, pp. 971- 977 ,(1997)
F.A. Gers, J. Schmidhuber, Recurrent nets that time and count international joint conference on neural network. ,vol. 3, pp. 189- 194 ,(2000) , 10.1109/IJCNN.2000.861302
Benjamin M Marlin, David C Kale, Robinder G Khemani, Randall C Wetzel, None, Unsupervised pattern discovery in electronic health care data using probabilistic clustering models international health informatics symposium. pp. 389- 398 ,(2012) , 10.1145/2110363.2110408
Jeffrey L. Elman, Finding Structure in Time Cognitive Science. ,vol. 14, pp. 179- 211 ,(1990) , 10.1207/S15516709COG1402_1
Norm Aleks, Diane Morabito, Kristan Staudenmayer, Stuart J Russell, Michael G. Madden, Mitchell Cohen, Geoffrey T. Manley, Probabilistic detection of short events, with application to critical care monitoring neural information processing systems. ,vol. 21, pp. 49- 56 ,(2008)
S. Amari, A. Cichocki, Adaptive blind signal processing-neural network approaches Proceedings of the IEEE. ,vol. 86, pp. 2026- 2048 ,(1998) , 10.1109/5.720251
Susannah Fleming, Matthew Thompson, Richard Stevens, Carl Heneghan, Annette Plüddemann, Ian Maconochie, Lionel Tarassenko, David Mant, Normal ranges of heart rate and respiratory rate in children from birth to 18 years of age: a systematic review of observational studies The Lancet. ,vol. 377, pp. 1011- 1018 ,(2011) , 10.1016/S0140-6736(10)62226-X
Ilya Sutskever, Quoc V. Le, Oriol Vinyals, Sequence to Sequence Learning with Neural Networks neural information processing systems. ,vol. 27, pp. 3104- 3112 ,(2014)