Community Detection via Random and Adaptive Sampling

作者: Alexandre Proutière , Se-Young Yun

DOI:

关键词:

摘要: In this paper, we consider networks consisting of a finite number non-overlapping communities. To extract these communities, the interaction between pairs nodes may be sampled from large available data set, which allows given node pair to several times. When is sampled, observed outcome binary random variable, equal 1 if interact and 0 otherwise. The more likely positive belong same For budget samples or observations, wish jointly design sampling strategy (the sequence pairs) clustering algorithm that recover hidden communities with highest possible accuracy. We both non-adaptive adaptive strategies, for classes derive fundamental performance limits satisfied by any algorithm. particular, provide necessary conditions existence algorithms recovering accurately as network size grows large. also devise simple reconstruct when at all possible, hence proving proposed accurate community detection are sufficient. classical problem in stochastic block model can seen particular instance problems here. But our framework covers general scenarios where designed an manner. paper provides new results model, extends analysis case sampling.

参考文章(24)
Ohad Shamir, Naftali Tishby, Spectral Clustering on a Budget international conference on artificial intelligence and statistics. pp. 661- 669 ,(2011)
Alexander Tsiatas, Kamalika Chaudhuri, Fan Chung, Spectral Clustering of Graphs with General Degrees in the Extended Planted Partition Model conference on learning theory. ,(2012)
Anirban Dasgupta, John Hopcroft, Ravi Kannan, Pradipta Mitra, Spectral Clustering by Recursive Partitioning Lecture Notes in Computer Science. pp. 256- 267 ,(2006) , 10.1007/11841036_25
F. McSherry, Spectral partitioning of random graphs international conference on cluster computing. pp. 529- 537 ,(2001) , 10.1109/SFCS.2001.959929
Laurent Massoulié, Marc Lelarge, Simon Heimlicher, Community Detection in the Labelled Stochastic Block Model arXiv: Social and Information Networks. ,(2012)
Aurelien Decelle, Florent Krzakala, Cristopher Moore, Lenka Zdeborová, Inference and phase transitions in the detection of modules in sparse networks. Physical Review Letters. ,vol. 107, pp. 065701- 065701 ,(2011) , 10.1103/PHYSREVLETT.107.065701
Elchanan Mossel, Joe Neeman, Allan Sly, Stochastic Block Models and Reconstruction arXiv: Probability. ,(2012)
Elchanan Mossel, Joe Neeman, Allan Sly, A Proof Of The Block Model Threshold Conjecture arXiv: Probability. ,(2013)
Ravi B. Boppana, Eigenvalues and graph bisection: An average-case analysis 28th Annual Symposium on Foundations of Computer Science (sfcs 1987). pp. 280- 285 ,(1987) , 10.1109/SFCS.1987.22
T.L Lai, Herbert Robbins, Asymptotically efficient adaptive allocation rules Advances in Applied Mathematics. ,vol. 6, pp. 4- 22 ,(1985) , 10.1016/0196-8858(85)90002-8