Optimal Construction of Multi-Dimensional Indexes in Time-Series Databases: A Physical Database Design Approach.

作者: Sang-Wook Kim , Sanghyun Park , Byung-Ill Han , Jin-Ho Kim

DOI:

关键词:

摘要: Similarity search in time-series databases is an operation that finds such data sequences whose changing patterns are similar to of a query sequence. Typically, it hires the multi-dimensional index for its efficient processing. In order alleviate dimensionality curse, problem high-dimensional cases, previous methods similarity apply Discrete Fourier Transform(DFT) sequences, and take only first two or three DFT coefficients selecting organizing attributes index. Other than this ad-hoc approach, there have been no research efforts on devising systematic guideline choosing best among all coefficients. This paper points out problems occurred methods, proposes novel solution construct optimal The proposed method analyzes characteristics target database, then identifies having discrimination power. Finally, determines number by using cost model search. We show effectiveness through series experiments.

参考文章(11)
Alberto Belussi, Christos Faloutsos, Estimating the Selectivity of Spatial Queries Using the `Correlation' Fractal Dimension very large data bases. pp. 299- 310 ,(1995)
Rakesh Agrawal, Christos Faloutsos, Arun Swami, None, Efficient Similarity Search In Sequence Databases FODO '93 Proceedings of the 4th International Conference on Foundations of Data Organization and Algorithms. pp. 69- 84 ,(1993) , 10.1007/3-540-57301-1_5
Dina Q. Goldin, Paris C. Kanellakis, On Similarity Queries for Time-Series Data: Constraint Specification and Implementation principles and practice of constraint programming. pp. 137- 153 ,(1995) , 10.1007/3-540-60299-2_9
Harpreet S. Sawhney, King-Ip Lin, Kyuseok Shim, Rakesh Agrawal, Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases very large data bases. pp. 490- 501 ,(1995)
Sanghyun Park, Sang-Wook Kim, June-Suh Cho, Sriram Padmanabhan, Prefix-querying Proceedings of the tenth international conference on Information and knowledge management - CIKM'01. pp. 255- 262 ,(2001) , 10.1145/502585.502629
Davood Rafiei, Alberto Mendelzon, Similarity-based queries for time series data international conference on management of data. ,vol. 26, pp. 13- 25 ,(1997) , 10.1145/253260.253264
Christos Faloutsos, Ibrahim Kamel, Beyond uniformity and independence: analysis of R-trees using the concept of fractal dimension symposium on principles of database systems. pp. 4- 13 ,(1994) , 10.1145/182591.182593
Christos Faloutsos, M. Ranganathan, Yannis Manolopoulos, Fast subsequence matching in time-series databases Proceedings of the 1994 ACM SIGMOD international conference on Management of data - SIGMOD '94. ,vol. 23, pp. 419- 429 ,(1994) , 10.1145/191839.191925
S. Park, W.W. Chu, J. Yoon, C. Hsu, Efficient searches for similar subsequences of different lengths in sequence databases international conference on data engineering. pp. 23- 32 ,(2000) , 10.1109/ICDE.2000.839384
Sang-Wook Kim, Sanghyun Park, W.W. Chu, An index-based approach for similarity search supporting time warping in large sequence databases international conference on data engineering. pp. 607- 614 ,(2001) , 10.1109/ICDE.2001.914875