CloseGraph: mining closed frequent graph patterns

作者: Xifeng Yan , Jiawei Han

DOI: 10.1145/956750.956784

关键词: MathematicsData miningMolecule miningComplement graphExistential quantificationExponential numberGraph patternsData structureGraph (abstract data type)Epigraph

摘要: Recent research on pattern discovery has progressed form mining frequent itemsets and sequences to structured patterns including trees, lattices, graphs. As a general data structure, graph can model complicated relations among with wide applications in bioinformatics, Web exploration, etc. However, large challenging due the presence of an exponential number subgraphs. Instead all subgraphs, we propose mine closed patterns. A g is database if there exists no proper supergraph that same support as g. algorithm, CloseGraph, developed by exploring several interesting pruning methods. Our performance study shows CloseGraph not only dramatically reduces unnecessary subgraphs be generated but also substantially increases efficiency mining, especially

参考文章(19)
Lawrence B. Holder, Diane J. Cook, Surnjani Djoko, Substructure discovery in the SUBDUE system knowledge discovery and data mining. pp. 169- 180 ,(1994)
Thomas H. Cormen, Introduction to algorithms [2nd ed.] MIT Press. ,(2001)
J. Pei, Jiawei Han, Runying Mao, CLOSET : An Efficient Algorithm for Mining Frequent Closed Itemsets international conference on management of data. pp. 21- 30 ,(2000)
TH Cormen, RL Rivest, CE Leiserson, C Stein, Introduction to Algorithms, 2nd edition. ,(2001)
節夫 有川, 比呂志 坂本, 真治 川副, Setsuo Arikawa, 賢治 安部, 達哉 浅井, 博紀 有村, Shinji Kawasoe, Kenji Abe, Hiroshi Sakamoto, Hiroki Arimura, Tatsuya Asai, Efficient Substructure Discovery from Large Semi-structed Data DOI Technical Report. ,vol. 200, ,(2001)
Ramakrishnan Srikant, Rakesh Agrawal, Fast Algorithms for Mining Association Rules in Large Databases very large data bases. pp. 487- 499 ,(1994)
Mohammed Javeed Zaki, Ching-Jiu Hsiao, CHARM : An Efficient Algorithm for Closed Itemset Mining siam international conference on data mining. pp. 457- 473 ,(2002)
Nicolas Pasquier, Yves Bastide, Rafik Taouil, Lotfi Lakhal, Discovering Frequent Closed Itemsets for Association Rules international conference on database theory. ,vol. 1540, pp. 398- 416 ,(1999) , 10.1007/3-540-49257-7_25
Akihiro Inokuchi, Takashi Washio, Hiroshi Motoda, An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data european conference on principles of data mining and knowledge discovery. pp. 13- 23 ,(2000) , 10.1007/3-540-45372-5_2
Jiawei Han, Ramin Afshar, Xifeng Yan, CloSpan: Mining Closed Sequential Patterns in Large Databases. siam international conference on data mining. pp. 166- 177 ,(2003)