Experimental comparison of representation methods and distance measures for time series data

作者: Xiaoyue Wang , Abdullah Mueen , Hui Ding , Goce Trajcevski , Peter Scheuermann

DOI: 10.1007/S10618-012-0250-5

关键词: Representation (mathematics)Distance measuresSeries (mathematics)Artificial intelligenceSimilarity (psychology)Computer scienceMachine learningContext (language use)Variety (cybernetics)Dimensionality reductionTime series

摘要: The previous decade has brought a remarkable increase of the interest in applications that deal with querying and mining time series data. Many research efforts this context have focused on introducing new representation methods for dimensionality reduction or novel similarity measures underlying In vast majority cases, each individual work particular method made specific claims and, aside from occasional theoretical justifications, provided quantitative experimental observations. However, most part, comparative aspects these experiments were too narrowly demonstrating benefits proposed over some previously introduced ones. order to provide comprehensive validation, we conducted an extensive study re-implementing eight different representations nine their variants, testing effectiveness 38 data sets wide variety application domains. article, give overview techniques present our findings regarding effectiveness. addition providing unified validation existing achievements, also indicate that, certain literature may be unduly optimistic.

参考文章(67)
Eamonn Keogh, Kaushik Chakrabarti, Michael Pazzani, Sharad Mehrotra, Dimensionality reduction for fast similarity search in large time series databases Knowledge and Information Systems. ,vol. 3, pp. 263- 286 ,(2001) , 10.1007/PL00011669
Yunyue Zhu, Dennis Shasha, Warping indexes with envelope transforms for query by humming international conference on management of data. pp. 181- 192 ,(2003) , 10.1145/872757.872780
Apostolos N. Papadopoulos, Trajectory retrieval with latent semantic analysis Proceedings of the 2008 ACM symposium on Applied computing - SAC '08. pp. 1089- 1094 ,(2008) , 10.1145/1363686.1363941
Ioannis Karydis, Alexandros Nanopoulos, Apostolos N. Papadopoulos, Yannis Manolopoulos, Evaluation of similarity searching methods for music data in P2P networks International Journal of Business Intelligence and Data Mining. ,vol. 1, pp. 210- 228 ,(2005) , 10.1504/IJBIDM.2005.008363
Yutao Shou, Nikos Mamoulis, David W. Cheung, Fast and Exact Warping of Time Series Using Adaptive Segmental Approximations Machine Learning. ,vol. 58, pp. 231- 267 ,(2005) , 10.1007/S10994-005-5828-3
Eamonn Keogh, Chotirat Ann Ratanamahatana, Exact indexing of dynamic time warping Knowledge and Information Systems. ,vol. 7, pp. 358- 386 ,(2005) , 10.1007/S10115-004-0154-9
K. Kawagoe, T. Ueda, A similarity search method of time series data with combination of Fourier and wavelet transforms Proceedings Ninth International Symposium on Temporal Representation and Reasoning. pp. 86- 92 ,(2002) , 10.1109/TIME.2002.1027480
Steven L. Salzberg, On Comparing Classifiers: Pitfalls toAvoid and a Recommended Approach Data Mining and Knowledge Discovery. ,vol. 1, pp. 317- 328 ,(1997) , 10.1023/A:1009752403260
Elias Frentzos, Kostas Gratsias, Yannis Theodoridis, Index-based Most Similar Trajectory Search international conference on data engineering. pp. 816- 825 ,(2007) , 10.1109/ICDE.2007.367927
Lei Chen, M. Tamer Özsu, Vincent Oria, Robust and fast similarity search for moving object trajectories Proceedings of the 2005 ACM SIGMOD international conference on Management of data - SIGMOD '05. pp. 491- 502 ,(2005) , 10.1145/1066157.1066213