Discovering Communities and Anomalies in Attributed Graphs: Interactive Visual Exploration and Summarization

作者: Bryan Perozzi , Leman Akoglu

DOI: 10.1145/3139241

关键词:

摘要: Given a network with node attributes, how can we identify communities and spot anomalies? How characterize, describe, or summarize the in succinct way? Community extraction requires measure of quality for connected subgraphs (e.g., social circles). Existing subgraph measures, however, either consider only connectedness nodes inside community ignore cross-edges at boundary density) quantify structure attributes conductance). In this work, focus on node-attributed networks introduce: (1) new attributed called normality, (2) algorithm that uses normality to extract few characterizing per community, (3) summarization interactive visualization approach graph exploration. More specifically, first introduce to normality an subgraph. Our normality measure carefully utilizes together both internal consistency external separability. We then formulate objective function automatically infer (called “focus”) respective attribute weights, so as maximize normality  score given Most notably, unlike many other approaches, our allows long they be “exonerated;” i.e., (i) are expected under null model, and/or (ii) their do not exhibit attributes. Next, propose AMEN (for Attributed Mining Entity Networks), simultaneously discovers graph, goal total normality. Communities which yields high cannot found considered low anomalous. Last, task multi-criteria objective, selects subset cover entire well, (iii) diverse further design interface presents user interpretable, user-friendly fashion. The explore all communities, analyze various algorithm-generated summaries, well devise own summaries interactively characterize way. As experiments real-world graphs show, proposed approaches effectively find anomalous outperform several existing measures methods, such conductance, density, OddBall, SODA. also conduct extensive studies capability efficiency provides users toward summarization, exploration, sensemaking.

参考文章(90)
A. V. Goldberg, Finding a Maximum Density Subgraph University of California at Berkeley. ,(1984)
Hiroaki Shiokawa, Yasuhiro Fujiwara, Makoto Onizuka, Fast algorithm for modularity-based graph clustering national conference on artificial intelligence. pp. 1170- 1176 ,(2013)
Leman Akoglu, Mary McGlohon, Christos Faloutsos, OddBall: spotting anomalies in weighted graphs knowledge discovery and data mining. ,vol. 6119, pp. 410- 421 ,(2010) , 10.1007/978-3-642-13672-6_40
Moses Charikar, Greedy approximation algorithms for finding dense components in a graph Lecture Notes in Computer Science. pp. 84- 95 ,(2000) , 10.1007/3-540-44436-X_10
M. Girvan, M. Girvan, M.E.J. Newman, M.E.J. Newman, Mixing Patterns and Community Structure in Networks Statistical Mechanics of Complex Networks. ,vol. 625, pp. 66- 87 ,(2003) , 10.1007/978-3-540-44943-0_5
Mark Newman, Networks: An Introduction ,(2010)
Qiaozhu Mei, Meng Qu, Mingzhe Wang, Jian Tang, Ming Zhang, Jun Yan, LINE: Large-scale Information Network Embedding the web conference. pp. 1067- 1077 ,(2015) , 10.1145/2736277.2741093
Stephan Gunnemann, Ines Farber, Sebastian Raubach, Thomas Seidl, Spectral Subspace Clustering for Graphs with Feature Vectors international conference on data mining. pp. 231- 240 ,(2013) , 10.1109/ICDM.2013.110
Cody Dunne, Ben Shneiderman, Motif simplification: improving network visualization readability with fan, connector, and clique glyphs human factors in computing systems. pp. 3247- 3256 ,(2013) , 10.1145/2470654.2466444
DARONG LAI, XIANGJUN WU, HONGTAO LU, CHRISTINE NARDINI, LEARNING OVERLAPPING COMMUNITIES IN COMPLEX NETWORKS VIA NON-NEGATIVE MATRIX FACTORIZATION International Journal of Modern Physics C. ,vol. 22, pp. 1173- 1190 ,(2011) , 10.1142/S0129183111016816