Mining the K-Most Interesting Frequent Patterns Sequentially

作者: Quang Tran Minh , Shigeru Oyanagi , Katsuhiro Yamazaki

DOI: 10.1007/11875581_75

关键词:

摘要: Conventional frequent pattern mining algorithms require users to specify some minimum support threshold, which is not easy identify without knowledge about the datasets in advance. This difficulty leads dilemma that either they may lose useful information or be able screen for interesting from huge presented patterns sets. Mining top-k allows control number of discovered analyzing. In this paper, we propose an optimized version ExMiner, called OExMiner, mine a large scale dataset efficiently and effectively. order improve user-friendliness also performance system proposed other 2 methods, extended Seq-Miner Seq-BOMA sequentially. Experiments on both synthetic real data show our methods are much more efficient effective compared existing ones.

参考文章(11)
J. Pei, Jiawei Han, Runying Mao, CLOSET : An Efficient Algorithm for Mining Frequent Closed Itemsets international conference on management of data. pp. 21- 30 ,(2000)
Gösta Grahne, Jianfei Zhu, Efficiently Using Prefix-trees in Mining Frequent Itemsets. FIMI. ,(2003)
Ramakrishnan Srikant, Rakesh Agrawal, Fast Algorithms for Mining Association Rules in Large Databases very large data bases. pp. 487- 499 ,(1994)
Tran Minh Quang, Shigeru Oyanagi, Katsuhiro Yamazaki, ExMiner: an efficient algorithm for mining top-k frequent patterns advanced data mining and applications. pp. 436- 447 ,(2006) , 10.1007/11811305_48
Ada Wai-chee Fu, Renfrew Wang-wai Kwong, Jian Tang, Mining N-most Interesting Itemsets international syposium on methodologies for intelligent systems. pp. 59- 67 ,(2000) , 10.1007/3-540-39963-1_7
Jiawei Han, Jian Pei, Yiwen Yin, Mining frequent patterns without candidate generation international conference on management of data. ,vol. 29, pp. 1- 12 ,(2000) , 10.1145/335191.335372
Jiawei Han, Jianyong Wang, Ying Lu, P. Tzvetkov, Mining top-k frequent closed patterns without minimum support international conference on data mining. pp. 211- 218 ,(2002) , 10.1109/ICDM.2002.1183905
Jianyong Wang, J. Han, Y. Lu, P. Tzvetkov, TFP: an efficient algorithm for mining top-k frequent closed itemsets IEEE Transactions on Knowledge and Data Engineering. ,vol. 17, pp. 652- 664 ,(2005) , 10.1109/TKDE.2005.81
Roberto J. Bayardo, Efficiently mining long patterns from databases Proceedings of the 1998 ACM SIGMOD international conference on Management of data - SIGMOD '98. ,vol. 27, pp. 85- 93 ,(1998) , 10.1145/276304.276313
Rodney Topor, Li Shen, Hong Shen, Paul Pritchard, Finding the N largest itemsets international conference on data mining. pp. 211- 222 ,(1998)