MSGPs: A Novel Algorithm for Mining Sequential Generator Patterns

作者: Thi-Thiet Pham , Jiawei Luo , Tzung-Pei Hong , Bay Vo

DOI: 10.1007/978-3-642-34707-8_40

关键词:

摘要: Sequential generator pattern mining is an important task in data mining. patterns used together with closed sequential can provide additional information that alone are not able to provide. In this paper, we proposed algorithm called MSGPs, which based on the characteristics of and sequence extensions by doing depth-first search prefix tree, find all patterns. This uses a vertical approach listing counting support, prime block encoding factorization theory represent candidate sequences determine frequency for each candidate. Experimental results showed effective.

参考文章(20)
David Lo, Siau-Cheng Khoo, Jinyan Li, Mining and Ranking Generators of Sequential Pattern siam international conference on data mining. pp. 553- 564 ,(2008)
Jian Pei, Guozhu Dong, Jinyan Li, Limsoon Wong, Haiquan Li, Minimum description length principle: generators are preferable to closed patterns national conference on artificial intelligence. ,vol. 1, pp. 409- 414 ,(2006)
Mohammed J. Zaki, SPADE: An Efficient Algorithm for Mining Frequent Sequences Machine Learning. ,vol. 42, pp. 31- 60 ,(2001) , 10.1023/A:1007652502315
Ramakrishnan Srikant, Rakesh Agrawal, Mining sequential patterns: Generalizations and performance improvements Advances in Database Technology — EDBT '96. pp. 1- 17 ,(1996) , 10.1007/BFB0014140
Jiawei Han, Ramin Afshar, Xifeng Yan, CloSpan: Mining Closed Sequential Patterns in Large Databases. siam international conference on data mining. pp. 166- 177 ,(2003)
Congnan Luo, Soon M. Chung, A scalable algorithm for mining maximal frequent sequences using a sample Knowledge and Information Systems. ,vol. 15, pp. 149- 179 ,(2008) , 10.1007/S10115-006-0056-0
Guo-Yan Huang, Fei Yang, Chang-Zhen Hu, Jia-Dong Ren, Fast discovery of frequent closed sequential patterns based on positional data international conference on machine learning and cybernetics. ,vol. 1, pp. 444- 449 ,(2010) , 10.1109/ICMLC.2010.5581020
Jay Ayres, Jason Flannick, Johannes Gehrke, Tomi Yiu, Sequential PAttern mining using a bitmap representation Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '02. pp. 429- 435 ,(2002) , 10.1145/775047.775109
Ngoc Thanh Nguyen, A METHOD FOR ONTOLOGY CONFLICT RESOLUTION AND INTEGRATION ON RELATION LEVEL Cybernetics and Systems. ,vol. 38, pp. 781- 797 ,(2007) , 10.1080/01969720701601098
Karam Gouda, Mosab Hassaan, Mohammed J. Zaki, Prism: An effective approach for frequent sequence mining via prime-block encoding Journal of Computer and System Sciences. ,vol. 76, pp. 88- 102 ,(2010) , 10.1016/J.JCSS.2009.05.008