Subset Infinite Relational Models

作者: Katsuhiko Ishiguro , Hiroshi Sawada , Naonori Ueda

DOI:

关键词: Logical data modelSocial networkRelational databasePairwise comparisonData miningCluster (physics)Computer scienceData entryLatent variablePrior probability

摘要: We propose a new probabilistic generative model for analyzing sparse and noisy pairwise relational data, such as friend-links on social network services customer records in online shops. Real-world data often include large portion of non-informative entries. Many existing stochastic blockmodels suffer from these irrelevant entries because their rather simpler forms priors. The proposed incorporates latent variable that explicitly indicates whether each entry is relevant or not to diminish bad effects associated with data. Through experiments using synthetic real sets, we show the can extract clusters stronger relations among within cluster than obtained by conventional model.

参考文章(19)
Bryan Klimt, Yiming Yang, The enron corpus: a new dataset for email classification research european conference on machine learning. pp. 217- 226 ,(2004) , 10.1007/978-3-540-30115-8_22
Wenjie Fu, Le Song, Eric P. Xing, Dynamic mixed membership blockmodel for evolving networks Proceedings of the 26th Annual International Conference on Machine Learning - ICML '09. pp. 329- 336 ,(2009) , 10.1145/1553374.1553416
Lei Tang, Huan Liu, Jianping Zhang, Zohreh Nazeri, Community evolution in dynamic multi-mode networks Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD 08. pp. 677- 685 ,(2008) , 10.1145/1401890.1401972
Peter D. Hoff, Model-based subspace clustering Bayesian Analysis. ,vol. 1, pp. 321- 344 ,(2006) , 10.1214/06-BA111
Carlos M. Carvalho, Jeffrey Chang, Joseph E. Lucas, Joseph R. Nevins, Quanli Wang, Mike West, High-dimensional sparse factor modeling: Applications in gene expression genomics Journal of the American Statistical Association. ,vol. 103, pp. 1438- 1456 ,(2008) , 10.1198/016214508000000869
David Blackwell, James B. MacQueen, Ferguson Distributions Via Polya Urn Schemes Annals of Statistics. ,vol. 1, pp. 353- 355 ,(1973) , 10.1214/AOS/1176342372
Thomas L. Griffiths, Naonori Ueda, Joshua B. Tenenbaum, Takeshi Yamada, Charles Kemp, Learning systems of concepts with an infinite relational model national conference on artificial intelligence. ,vol. 1, pp. 381- 388 ,(2006)
Katsuhiko Ishiguro, Tomoharu Iwata, Naonori Ueda, Joshua B. Tenenbaum, Dynamic Infinite Relational Model for Time-varying Relational Data Analysis neural information processing systems. ,vol. 23, pp. 919- 927 ,(2010)
Thomas L Griffiths, Zoubin Ghahramani, None, The Indian Buffet Process: An Introduction and Review Journal of Machine Learning Research. ,vol. 12, pp. 1185- 1224 ,(2011) , 10.5555/1953048.2021039