Adaptive piecewise and symbolic aggregate approximation as an improved representation method for heat waves detection

作者: Aida A. Ferreira , Iona M. B. Rameh Barbosa , Ronaldo R. B. Aquino , Herrera Manuel , Sukumar Natarajan

DOI: 10.1007/978-3-030-01174-1_51

关键词: AlgorithmComputer scienceCurse of dimensionalityDiscrete Fourier transformTime seriesDiscrete wavelet transformRepresentation (mathematics)PiecewiseVariable (computer science)Series (mathematics)

摘要: Mining time series has attracted an increasing interest due to its wide applications in finance, industry, biology, environment, and so on. In order reduce execution storage space, many high level representations or abstractions of the raw data have been proposed including Discrete Fourier Transform (DFT), Wavelet (DWT), Piecewise Aggregate Approximation (PAA) Symbolic approXimation (SAX). this paper, we introduce a novel adaptive piecewise symbolic aggregate approximation (APAA/ASAX) which creates segments variable length automatically adapt any segment local condition variability difference average value current values is defined. The each from APAA represented as symbol ordered alphabet generating modified version for SAX called (ASAX). This straightforwardly allows handle more versatile definition event duration. method APAA/ASAX was used locating heat waves patterns real-world datasets daily temperature information, year 1970 until 2009. experimental results show that representation able locate heatwave events huge databases. Advantages regarding traditional PAA are mainly based on being constrain-free fixed schemes length. It also highlights ability self-tuning depending characteristics. means flat proposes lower number dimensionality than case deal with variability. approach will be use those looking extreme series.

参考文章(22)
Durga Toshniwal, Feature extraction from time series data Journal of Computational Methods in Sciences and Engineering archive. ,vol. 9, pp. 99- 110 ,(2009) , 10.3233/JCM-2009-0239
Eamonn Keogh, Indexing and Mining Time Series Data encyclopedia of gis. pp. 493- 497 ,(2008) , 10.1007/978-3-319-23519-6_598-2
Jessica Lin, Yuan Li, Finding Structural Similarity in Time Series Data Using Bag-of-Patterns Representation statistical and scientific database management. pp. 461- 477 ,(2009) , 10.1007/978-3-642-02279-1_33
Eamonn Keogh, Jessica Lin, Clustering of time series subsequences is meaningless: implications for previous and future research international conference on data mining. ,vol. 8, pp. 154- 177 ,(2003) , 10.1007/S10115-004-0172-7
Jidong Yuan, Zhihai Wang, Meng Han, Yange Sun, A lazy associative classifier for time series intelligent data analysis. ,vol. 19, pp. 983- 1002 ,(2015) , 10.3233/IDA-150754
Gary M. Weiss, Mining with rarity ACM SIGKDD Explorations Newsletter. ,vol. 6, pp. 7- 19 ,(2004) , 10.1145/1007730.1007734
Chris Chatfield, Andreas S. Weigend, Time series prediction: Forecasting the future and understanding the past International Journal of Forecasting. ,vol. 10, pp. 161- 163 ,(1994) , 10.1016/0169-2070(94)90058-2
Faras Rasheed, Mohammed Alshalalfa, Reda Alhajj, Efficient Periodicity Mining in Time Series Databases Using Suffix Trees IEEE Transactions on Knowledge and Data Engineering. ,vol. 23, pp. 79- 94 ,(2011) , 10.1109/TKDE.2010.76
Xiaoyue Wang, Abdullah Mueen, Hui Ding, Goce Trajcevski, Peter Scheuermann, Eamonn Keogh, Experimental comparison of representation methods and distance measures for time series data Data Mining and Knowledge Discovery. ,vol. 26, pp. 275- 309 ,(2013) , 10.1007/S10618-012-0250-5
Philippe Esling, Carlos Agon, Time-series data mining ACM Computing Surveys. ,vol. 45, pp. 1- 34 ,(2012) , 10.1145/2379776.2379788