Online Mining of Maximal Frequent Itemsequences from Data Streams

作者: Xindong Wu , Yue Sun , Xingquan Zhu , Guojun Mao , Chunnian Liu

DOI:

关键词:

摘要: Mining data streams often requires real-time extraction of interesting patterns from dynamic and continuously growing data. This requirement has imposed challenges on discovering outputting current useful in an instant way, commonly referred to as online streaming mining. In this paper, we present INSTANT, a novel algorithm that explores maximal frequent itemsequences fashion. We first provide operators the lattice itemsequential sets, then apply them design INSTANT. comparison with most popular methods such close-itemset based mining algorithms, INSTANT solid theoretical foundations ensure it employs more compact in-memory structures than closed itemsequences. Experimental results show our method can achieve better previous related terms both time space efficiency.

参考文章(22)
Rajeev Motwani, Mayur Datar, Brian Babcock, Load Shedding Techniques for Data Stream Systems ,(2003)
J. Pei, Jiawei Han, Runying Mao, CLOSET : An Efficient Algorithm for Mining Frequent Closed Itemsets international conference on management of data. pp. 21- 30 ,(2000)
Ramakrishnan Srikant, Rakesh Agrawal, Fast algorithms for mining association rules very large data bases. pp. 580- 592 ,(1998)
Joong Hyuk Chang, Won Suk Lee, Decaying Obsolete Information in Finding Recent Frequent Itemsets over Data Streams IEICE Transactions on Information and Systems. ,vol. 87, pp. 1588- 1592 ,(2004)
Mohammed Javeed Zaki, Ching-Jiu Hsiao, CHARM : An Efficient Algorithm for Closed Itemset Mining siam international conference on data mining. pp. 457- 473 ,(2002)
Yun Chi, Haixun Wang, P.S. Yu, R.R. Muntz, Moment: maintaining closed frequent itemsets over a stream sliding window international conference on data mining. pp. 59- 66 ,(2004) , 10.1109/ICDM.2004.10084
Nicolas Pasquier, Yves Bastide, Rafik Taouil, Lotfi Lakhal, Discovering Frequent Closed Itemsets for Association Rules international conference on database theory. ,vol. 1540, pp. 398- 416 ,(1999) , 10.1007/3-540-49257-7_25
Wei-Guang Teng, Ming-Syan Chen, Philip S. Yu, A regression-based temporal pattern mining scheme for data streams very large data bases. pp. 93- 104 ,(2003) , 10.1016/B978-012722442-8/50017-3