HCDF: A Hybrid Community Discovery Framework

作者: Tina Eliassi-Rad , Spiros Papadimitriou , Keith Henderson , Christos Faloutsos

DOI:

关键词:

摘要: We introduce a novel Bayesian framework for hybrid community discovery in graphs. Our framework,HCDF (short Hybrid Community Discovery Framework), can effectively incorporate hints from number of other detection algorithms and produce results that outperform the constituent parts. describe two HCDF-based approaches which are: (1) effective, terms link prediction performance robustness to small perturbations network structure; (2) consistent, effectiveness across various application domains; (3) scalable very large graphs; (4) nonparametric. extensive evaluation on collection diverse real-world graphs, with millions links, show our (a) achieve up 0.22 improvement as measured by area under ROC curve (AUC), (b) never have an AUC drops below 0.91 worst case, (c) find communities are robust structure defined Variation Information (an entropybased distance metric).

参考文章(25)
M. Stone, An Asymptotic Equivalence of Choice of Model by Cross-Validation and Akaike's Criterion Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 39, pp. 44- 47 ,(1977) , 10.1111/J.2517-6161.1977.TB01603.X
Martin Ester, Zengjian Hu, Byron J. Gao, Boaz Ben-Moshe, Rong Ge, Joint Cluster Analysis of Attribute Data and Relationship Data: the Connected k-Center Problem. siam international conference on data mining. pp. 246- 257 ,(2006)
Fan R K Chung, Spectral Graph Theory ,(1996)
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
Flavia Moser, Rong Ge, Martin Ester, Joint cluster analysis of attribute and relationship data withouta-priori specification of the number of clusters Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '07. pp. 510- 519 ,(2007) , 10.1145/1281192.1281248
Rong Ge, Martin Ester, Byron J. Gao, Zengjian Hu, Binay Bhattacharya, Boaz Ben-Moshe, Joint cluster analysis of attribute data and relationship data ACM Transactions on Knowledge Discovery from Data. ,vol. 2, pp. 1- 35 ,(2008) , 10.1145/1376815.1376816
Ding Zhou, Eren Manavoglu, Jia Li, C. Lee Giles, Hongyuan Zha, Probabilistic models for discovering e-communities Proceedings of the 15th international conference on World Wide Web - WWW '06. pp. 173- 182 ,(2006) , 10.1145/1135777.1135807
Gary William Flake, Steve Lawrence, C. Lee Giles, Efficient identification of Web communities knowledge discovery and data mining. pp. 150- 160 ,(2000) , 10.1145/347090.347121
Jure Ferlez, Christos Faloutsos, Jure Leskovec, Dunja Mladenic, Marko Grobelnik, Monitoring Network Evolution using MDL international conference on data engineering. pp. 1328- 1330 ,(2008) , 10.1109/ICDE.2008.4497545
Aaron Clauset, M. E. J. Newman, Cristopher Moore, Finding community structure in very large networks. Physical Review E. ,vol. 70, pp. 066111- ,(2004) , 10.1103/PHYSREVE.70.066111