An efficient algorithm for mining time interval-based patterns in large database

作者: Yi-Cheng Chen , Ji-Chiang Jiang , Wen-Chih Peng , Suh-Yin Lee

DOI: 10.1145/1871437.1871448

关键词:

摘要: Most studies on sequential pattern mining are mainly focused time point-based event data. Few research efforts have elaborated patterns from interval-based However, in many real applications, usually persists for an interval of time. Since the relationships among intervals intrinsically complex, large database is really a challenging problem. In this paper, novel approach, named as incision strategy and new representation, called coincidence representation proposed to simplify processing complex relations intervals. Then, efficient algorithm, CTMiner (Coincidence Temporal Miner) developed discover frequent time-interval based patterns. The algorithm also employs two pruning techniques reduce search space effectively. Furthermore, experimental results show that not only scalable but outperforms state-of-the-art algorithms.

参考文章(14)
Po-shan Kam, Ada Wai-chee Fu, Discovering Temporal Patterns for Interval-Based Events data warehousing and knowledge discovery. pp. 317- 326 ,(2000) , 10.1007/3-540-44466-1_32
F. Masseglia, F. Cathala, P. Poncelet, The PSP approach for mining sequential patterns Principles of Data Mining and Knowledge Discovery. pp. 176- 184 ,(1998) , 10.1007/BFB0094818
Behzad Mortazavi-Asl, Umeshwar Dayal, Qiming Chen, Jiawei Han, Jian Pei, Meichun Hsu, Helen Pinto, PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth international conference on data engineering. pp. 215- 224 ,(2001)
Mohammed J. Zaki, SPADE: An Efficient Algorithm for Mining Frequent Sequences Machine Learning. ,vol. 42, pp. 31- 60 ,(2001) , 10.1023/A:1007652502315
Ramakrishnan Srikant, Rakesh Agrawal, Mining sequential patterns: Generalizations and performance improvements Advances in Database Technology — EDBT '96. pp. 1- 17 ,(1996) , 10.1007/BFB0014140
Frank Höppner, Frank Klawonn, Finding Informative Rules in Interval Sequences intelligent data analysis. ,vol. 6, pp. 125- 134 ,(2001) , 10.1007/3-540-44816-0_13
Edi Winarko, John F. Roddick, ARMADA - An algorithm for discovering richer relative temporal association rules from interval-based data data and knowledge engineering. ,vol. 63, pp. 76- 90 ,(2007) , 10.1016/J.DATAK.2006.10.009
Fabian Mörchen, Alfred Ultsch, Efficient mining of understandable patterns from multivariate interval time series Data Mining and Knowledge Discovery. ,vol. 15, pp. 181- 215 ,(2007) , 10.1007/S10618-007-0070-1
Dhaval Patel, Wynne Hsu, Mong Li Lee, Mining relationships among interval-based events for classification Proceedings of the 2008 ACM SIGMOD international conference on Management of data - SIGMOD '08. pp. 393- 404 ,(2008) , 10.1145/1376616.1376658
Shin-Yi Wu, Yen-Liang Chen, Mining Nonambiguous Temporal Patterns for Interval-Based Events IEEE Transactions on Knowledge and Data Engineering. ,vol. 19, pp. 742- 758 ,(2007) , 10.1109/TKDE.2007.190613