Towards a minimum description length based stopping criterion for semi-supervised time series classification

作者: Nurjahan Begum , Bing Hu , Thanawin Rakthanmanon , Eamonn Keogh

DOI: 10.1109/IRI.2013.6642490

关键词:

摘要: In the last decade plunging costs of sensors/storage have made it possible to obtain vast amounts medical telemetry. However for this data be useful, must annotated. This annotation, requiring attention experts is very expensive and time consuming, remains critical bottleneck in analysis. Semi-supervised learning an obvious way mitigate need human labor, however, most such algorithms are designed intrinsically discrete objects, do not work well domain, which requires ability deal with real-valued objects arriving a streaming fashion. we make two contributions. First, demonstrate that many cases just handful annotated examples sufficient perform accurate classification. Second, devise novel parameter-free stopping criterion semi-supervised learning. We evaluate our comprehensive set experiments on diverse sources including electrocardiograms. Our experimental results show approach can construct classifiers even if given only single instance.

参考文章(24)
Chotirat (Ann) Ratanamahatana, Eamonn J. Keogh, Making Time-Series Classification More Accurate Using Learned Constraints. siam international conference on data mining. pp. 11- 22 ,(2004)
Mirjana Ivanovic, Alexandros Nanopoulos, Milos Radovanovic, Time-Series Classification in Many Intrinsic Dimensions. siam international conference on data mining. pp. 677- 688 ,(2010)
Minh Nhut Nguyen, Xiao-Li Li, See-Kiong Ng, None, Ensemble Based Positive Unlabeled Learning for Time Series Classification Database Systems for Advanced Applications. pp. 243- 257 ,(2012) , 10.1007/978-3-642-29038-1_19
Semi-Supervised Learning Advanced Methods in Sequence Analysis Lectures. pp. 221- 232 ,(2010) , 10.7551/MITPRESS/9780262033589.001.0001
Chotirat Ann Ratanamahatana, Dechawut Wanichsan, Stopping Criterion Selection for Efficient Semi-supervised Time Series Classification software engineering, artificial intelligence, networking and parallel/distributed computing. pp. 1- 14 ,(2008) , 10.1007/978-3-540-70560-4_1
Peter Grünwald, A Tutorial Introduction to the Minimum Description Length Principle arXiv: Statistics Theory. ,(2004)
Pierre Geurts, Pattern Extraction for Time Series Classification european conference on principles of data mining and knowledge discovery. pp. 115- 127 ,(2001) , 10.1007/3-540-44794-6_10
S.D. Greenwald, R.S. Patil, R.G. Mark, Improved detection and classification of arrhythmias in noise-corrupted electrocardiograms using contextual information [1990] Proceedings Computers in Cardiology. pp. 461- 464 ,(1990) , 10.1109/CIC.1990.144257
Biju P. Simon, C. Eswaran, An ECG classifier designed using modified decision based neural networks Computers and Biomedical Research. ,vol. 30, pp. 257- 272 ,(1997) , 10.1006/CBMR.1997.1446