Text Clustering via Constrained Nonnegative Matrix Factorization

作者: Yan Zhu , Liping Jing , Jian Yu

DOI: 10.1109/ICDM.2011.143

关键词:

摘要: Semi-supervised nonnegative matrix factorization (NMF)receives more and attention in text mining field. The semi-supervised NMF methods can be divided into two types, one is based on the explicit category labels, other pair wise constraints including must-link cannot-link. As it hard to obtain labels some tasks, latter widely used real applications. To date, all constrained treat cannot-link a same way. However, these kinds of play different roles clustering. Thus novel method proposed this paper. In new method, are control distance data compressed form, cannot-ink encoding factor. Experimental results real-world sets have shown good performance method.

参考文章(16)
Changshui Zhang, Fei Wang, Tao Li, Semi-Supervised Clustering via Matrix Factorization. siam international conference on data mining. pp. 1- 12 ,(2008)
Zhaohui Wu, Haifeng Liu, Non-negative matrix factorization with constraints national conference on artificial intelligence. pp. 506- 511 ,(2010)
Daniel D. Lee, H. Sebastian Seung, Learning the parts of objects by non-negative matrix factorization Nature. ,vol. 401, pp. 788- 791 ,(1999) , 10.1038/44565
Wei Xu, Xin Liu, Yihong Gong, Document clustering based on non-negative matrix factorization international acm sigir conference on research and development in information retrieval. pp. 267- 273 ,(2003) , 10.1145/860435.860485
Chris Ding, Tao Li, Wei Peng, Haesun Park, Orthogonal nonnegative matrix t-factorizations for clustering knowledge discovery and data mining. pp. 126- 135 ,(2006) , 10.1145/1150402.1150420
Chih-Jen Lin, Projected Gradient Methods for Nonnegative Matrix Factorization Neural Computation. ,vol. 19, pp. 2756- 2779 ,(2007) , 10.1162/NECO.2007.19.10.2756
Farial Shahnaz, Michael W Berry, V Paul Pauca, Robert J Plemmons, None, Document clustering using nonnegative matrix factorization Information Processing and Management. ,vol. 42, pp. 373- 386 ,(2006) , 10.1016/J.IPM.2004.11.005
H. Sebastian Seung, Daniel D. Lee, Algorithms for Non-negative Matrix Factorization neural information processing systems. ,vol. 13, pp. 556- 562 ,(2000)
Yanhua Chen, Manjeet Rege, Ming Dong, Jing Hua, Non-negative matrix factorization for semi-supervised data clustering Knowledge and Information Systems. ,vol. 17, pp. 355- 379 ,(2008) , 10.1007/S10115-008-0134-6