FS 3 : A sampling based method for top-k frequent subgraph mining

作者: Tanay Kumar Saha , Mohammad Al Hasan

DOI: 10.1109/BIGDATA.2014.7004359

关键词:

摘要: Mining labeled subgraph is a popular research task in data mining because of its potential application many different scientific domains. All the existing methods for this explicitly or implicitly solve isomorphism which computationally expensive, so they suffer from lack scalability problem when graphs input database are large. In work, we propose FS3, sampling based method. It mines small collection subgraphs that most frequent probabilistic sense. FS3 performs Markov Chain Monte Carlo (MCMC) over space fixed-size such potentially sampled more often. Besides, equipped with an innovative queue manager. stores finite course manner top-k positions contain subgraphs. Our experiments on large show efficient, and it obtains amongst given size.

参考文章(24)
Mohammad Al Hasan, Mohammed Javeed Zaki, MUSK: Uniform sampling of k maximal patterns siam international conference on data mining. pp. 650- 661 ,(2009)
Marc Wörlein, Thorsten Meinl, Ingrid Fischer, Michael Philippsen, A Quantitative Comparison of the Subgraph Miners MoFa, gSpan, FFSM, and Gaston Knowledge Discovery in Databases: PKDD 2005. pp. 392- 403 ,(2005) , 10.1007/11564126_39
Marisa Thoma, Hong Cheng, Arthur Gretton, Jiawei Han, Hans-Peter Kriegel, Alex Smola, Le Song, Philip S. Yu, Xifeng Yan, Karsten Borgwardt, Near-optimal supervised feature selection among frequent subgraphs siam international conference on data mining. pp. 1076- 1087 ,(2009) , 10.1137/1.9781611972795.92
Björn Bringmann, Albrecht Zimmermann, Luc De Raedt, Siegfried Nijssen, Don’t Be Afraid of Simpler Patterns Lecture Notes in Computer Science. ,vol. 4213, pp. 55- 66 ,(2006) , 10.1007/11871637_10
Akihiro Inokuchi, Takashi Washio, Hiroshi Motoda, An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data european conference on principles of data mining and knowledge discovery. pp. 13- 23 ,(2000) , 10.1007/3-540-45372-5_2
R. Vijayalakshmi, R. Nadarajan, John F. Roddick, M. Thilaga, P. Nirmala, FP-GraphMiner-A Fast Frequent Pattern Mining Algorithm for Network Graphs Journal of Graph Algorithms and Applications. ,vol. 15, pp. 753- 776 ,(2011) , 10.7155/JGAA.00247
Lini T. Thomas, Satyanarayana R. Valluri, Kamalakar Karlapalem, MARGIN: Maximal frequent subgraph mining ACM Transactions on Knowledge Discovery From Data. ,vol. 4, pp. 10- ,(2010) , 10.1145/1839490.1839491
Siegfried Nijssen, Joost N. Kok, The Gaston Tool for Frequent Subgraph Mining Electronic Notes in Theoretical Computer Science. ,vol. 127, pp. 77- 87 ,(2005) , 10.1016/J.ENTCS.2004.12.039
Xifeng Yan, Jiawei Han, CloseGraph: mining closed frequent graph patterns knowledge discovery and data mining. pp. 286- 295 ,(2003) , 10.1145/956750.956784