Mining Clinical Data with a Temporal Dimension: A Case Study

作者: Dan He , Xindong Wu , Xingquan Zhu

DOI: 10.1109/BIBM.2007.48

关键词:

摘要: Clinical databases store large amounts of information about patients and their medical conditions. Data mining techniques can extract relationships patterns holding in this wealth data, thus be helpful understanding the progression diseases efficacy associated therapies. A typical structure data is a sequence observations clinical parameters taken at different time moments. In kind contexts, temporal dimension fundamental variable that should account process returned as part extracted knowledge. Therefore, classical well established framework sequential pattern not enough, because it only focuses on sequentiality events, without extracting elapsing between two particular events. Time-annotated sequences (IAS), novel paradigm solves problem. Recently defined our laboratory together with an efficient algorithm for them, IAS are where each transition events annotated found frequent data. paper we report real-world case study, which applied to regarding set follow-up liver transplantation. The aim analysis assessing effectiveness extracorporeal photopheresis (ECP) therapy prevent rejection solid organ For patient, biochemical variables recorded moments after show values interleukins other specific dates, from possible physician assess ECP therapy. We believe study does interestingness context but, more ambitiously, suggests general methodology mining, whenever important problem analysis.

参考文章(27)
Sourav S. Bhowmick, Qiankun Zhao, Sequential Pattern Mining: A Survey ,(2003)
ROBERT M. KNOBLER, WINFRIED GRANINGER, ANDREAS LINDMAIER, FRANZ TRAUTINGER, Photopheresis for the Treatment of Lupus Erythematosus Annals of the New York Academy of Sciences. ,vol. 636, pp. 340- 356 ,(1991) , 10.1111/J.1749-6632.1991.TB33464.X
Behzad Mortazavi-Asl, Umeshwar Dayal, Qiming Chen, Jiawei Han, Jian Pei, Meichun Hsu, Helen Pinto, PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth international conference on data engineering. pp. 215- 224 ,(2001)
Michael S. Waterman, Introduction to computational biology : Chapman & Hall. ,(1995) , 10.1007/978-1-4899-6846-3
Mohammed J. Zaki, SPADE: An Efficient Algorithm for Mining Frequent Sequences Machine Learning. ,vol. 42, pp. 31- 60 ,(2001) , 10.1023/A:1007652502315
Ramakrishnan Srikant, Rakesh Agrawal, Mining sequential patterns: Generalizations and performance improvements Advances in Database Technology — EDBT '96. pp. 1- 17 ,(1996) , 10.1007/BFB0014140
Yunyue Zhu, Dennis Shasha, StatStream: statistical monitoring of thousands of data streams in real time very large data bases. pp. 358- 369 ,(2002) , 10.1016/B978-155860869-6/50039-1
Yang-Sae Moon, Kyu-Young Whang, Wook-Shin Han, General match Proceedings of the 2002 ACM SIGMOD international conference on Management of data - SIGMOD '02. pp. 382- 393 ,(2002) , 10.1145/564691.564735
Fosca Giannotti, Mirco Nanni, Fabio Pinelli, Dino Pedreschi, Trajectory pattern mining Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '07. pp. 330- 339 ,(2007) , 10.1145/1281192.1281230