Leveraging Neighbor Attributes for Classification in Sparsely Labeled Networks

作者: Luke K. McDowell , David W. Aha

DOI: 10.1145/2898358

关键词:

摘要: Many analysis tasks involve linked nodes, such as people connected by friendship links. Research on link-based classification (LBC) has studied how to leverage these connections improve accuracy. Most prior research assumed the provision of a densely labeled training network. Instead, this article studies common and challenging case when LBC must use single sparsely network for both learning inference, where existing methods often yield poor To address challenge, we introduce novel method that enables prediction via “neighbor attributes,” which were briefly considered early work but then abandoned due perceived problems. We explain, using extensive experiments loss decomposition analysis, neighbor attributes significantly improves further show appropriate semi-supervised (SSL) is essential obtaining best accuracy in domain gains remain across range SSL choices data conditions. Finally, given challenges label sparsity impact attributes, multiple previous be re-considered, including regarding model features, noisy strategies active learning.

参考文章(83)
Julian Besag, Statistical Analysis of Non-Lattice Data The Statistician. ,vol. 24, pp. 179- 195 ,(1975) , 10.2307/2987782
Aditya Krishna Menon, Charles Elkan, Link prediction via matrix factorization european conference on machine learning. pp. 437- 452 ,(2011) , 10.1007/978-3-642-23783-6_28
Tanwistha Saha, Huzefa Rangwala, Carlotta Domeniconi, FLIP: active learning for relational network classification european conference on machine learning. pp. 1- 18 ,(2014) , 10.1007/978-3-662-44845-8_1
Galileo Namata, Ben London, Lise Getoor, Bert Huang, U Edu, Query-driven Active Surveying for Collective Classification ,(2012)
Daniel Lowd, MohamadAli Torkamani, Convex Adversarial Collective Classification international conference on machine learning. pp. 642- 650 ,(2013)
Lise Getoor, Stephen H. Bach, Bert Huang, Unifying Local Consistency and MAX SAT Relaxations for Scalable Inference with Rounding Guarantees international conference on artificial intelligence and statistics. pp. 46- 55 ,(2015)
Semi-Supervised Learning Advanced Methods in Sequence Analysis Lectures. pp. 221- 232 ,(2010) , 10.7551/MITPRESS/9780262033589.001.0001
Lise Getoor, Stephen Bach, Bert Huang, Ben London, Hinge-loss Markov Random Fields: Convex Inference for Structured Prediction arXiv: Learning. ,(2013)
Julian Besag, On the statistical analysis of dirty pictures Journal of the royal statistical society series b-methodological. ,vol. 48, pp. 259- 279 ,(1986) , 10.1111/J.2517-6161.1986.TB01412.X
R. A. Rossi, L. K. McDowell, D. W. Aha, J. Neville, Transforming graph data for statistical relational learning Journal of Artificial Intelligence Research. ,vol. 45, pp. 363- 441 ,(2012) , 10.1613/JAIR.3659