Probabilistic multi-label classification with sparse feature learning

作者: Wei Xue , Yuhong Guo

DOI:

关键词: Probabilistic logicMachine learningMulti-label classificationArtificial intelligenceEmpirical researchMathematicsPattern recognitionFeature learningText categorizationNorm (mathematics)Convex optimizationImage labeling

摘要: Multi-label classification is a critical problem in many areas of data analysis such as image labeling and text categorization. In this paper we propose probabilistic multi-label model based on novel sparse feature learning. By employing an individual sparsity inducing l1-norm group l2,1-norm, the proposed has capacity capturing both label interdependencies common predictive structures. We formulate norm regularized learning non-smooth convex optimization problem, develop fast proximal gradient algorithm to solve it for optimal solution. Our empirical study demonstrates efficacy method set tasks given limited number labeled training instances.

参考文章(35)
Nadia Ghamrawi, Andrew McCallum, Collective multi-label classification Proceedings of the 14th ACM international conference on Information and knowledge management - CIKM '05. pp. 195- 200 ,(2005) , 10.1145/1099554.1099591
Andrew Y. Ng, Feature selection, L1 vs. L2 regularization, and rotational invariance Twenty-first international conference on Machine learning - ICML '04. pp. 78- ,(2004) , 10.1145/1015330.1015435
Zhigang Ma, Heng Tao Shen, Xiaofang Zhou, Zi Huang, Yi Yang, l 2,1 -norm regularized discriminative feature selection for unsupervised learning international joint conference on artificial intelligence. pp. 1589- 1594 ,(2011) , 10.5591/978-1-57735-516-8/IJCAI11-267
Shuiwang Ji, Lei Tang, Shipeng Yu, Jieping Ye, A shared-subspace learning framework for multi-label classification ACM Transactions on Knowledge Discovery From Data. ,vol. 4, pp. 8- ,(2010) , 10.1145/1754428.1754431
C. Bielza, G. Li, P. Larrañaga, Multi-dimensional classification with Bayesian networks International Journal of Approximate Reasoning. ,vol. 52, pp. 705- 727 ,(2011) , 10.1016/J.IJAR.2011.01.007
Robert E. Schapire, Yoram Singer, BoosTexter: A Boosting-based Systemfor Text Categorization Machine Learning. ,vol. 39, pp. 135- 168 ,(2000) , 10.1023/A:1007649029923
Rong Yan, Jelena Tesic, John R. Smith, Model-shared subspace boosting for multi-label classification Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '07. pp. 834- 843 ,(2007) , 10.1145/1281192.1281281
Amir Beck, Marc Teboulle, A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems Siam Journal on Imaging Sciences. ,vol. 2, pp. 183- 202 ,(2009) , 10.1137/080716542
Tibério S. Caetano, James Petterson, Submodular Multi-Label Learning neural information processing systems. ,vol. 24, pp. 1512- 1520 ,(2011)
Shai Shalev-Shwartz, Yoram Singer, Efficient Learning of Label Ranking by Soft Projections onto Polyhedra Journal of Machine Learning Research. ,vol. 7, pp. 1567- 1599 ,(2006)