Hierarchical Multilabel Classification with Optimal Path Prediction

作者: Zhengya Sun , Yangyang Zhao , Dong Cao , Hongwei Hao

DOI: 10.1007/S11063-016-9526-X

关键词:

摘要: We consider multilabel classification problems where the labels are arranged hierarchically in a tree or directed acyclic graph (DAG). In this context, it is of much interest to select well-connected subset nodes which best preserve label dependencies according learned models. Top-down bottom-up procedures for labelling hierarchy have recently been proposed, but they rely largely on pairwise interactions, thus susceptible get stuck local optima. paper, we remedy problem by directly finding small number paths that can cover desired subgraph tree/DAG. To estimate high-dimensional vector, adopt advantages partial least squares techniques perform simultaneous projections feature and space, while constructing sound linear models between them. then show optimal prediction with constraints be reasonably transformed into path structured sparsity penalties. The introduction selection further allows us leverage efficient network flow solvers polynomial time complexity. experimental results validate promising performance proposed algorithm comparison state-of-the-art algorithms both tree- DAG-structured data sets.

参考文章(23)
S. Wold, H. Martens, H. Wold, The multivariate calibration problem in chemistry solved by the PLS method Springer, Berlin, Heidelberg. pp. 286- 293 ,(1983) , 10.1007/BFB0062108
H. Wold, Path Models with Latent Variables: The NIPALS Approach Quantitative Sociology#R##N#International Perspectives on Mathematical and Statistical Modeling. pp. 307- 357 ,(1975) , 10.1016/B978-0-12-103950-9.50017-4
Rodrigo C. Barros, Ricardo Cerri, Alex A. Freitas, André C. P. L. F. de Carvalho, Probabilistic clustering for hierarchical multi-label classification of protein functions european conference on machine learning. pp. 385- 400 ,(2013) , 10.1007/978-3-642-40991-2_25
Hendrik Blockeel, Leander Schietgat, Jan Struyf, Sašo Džeroski, Amanda Clare, Decision Trees for Hierarchical Multilabel Classification: A Case Study in Functional Genomics Lecture Notes in Computer Science. ,vol. 4213, pp. 18- 29 ,(2006) , 10.1007/11871637_7
Ricardo Cerri, Rodrigo C Barros, André CPLF de Carvalho, None, Hierarchical classification of Gene Ontology-based protein functions with neural networks international joint conference on neural network. pp. 1- 8 ,(2015) , 10.1109/IJCNN.2015.7280474
Celine Vens, Jan Struyf, Leander Schietgat, Sašo Džeroski, Hendrik Blockeel, Decision trees for hierarchical multi-label classification Machine Learning. ,vol. 73, pp. 185- 214 ,(2008) , 10.1007/S10994-008-5077-3
Ricardo Cerri, Rodrigo C Barros, André CPLF De Carvalho, None, Hierarchical multi-label classification using local neural networks Journal of Computer and System Sciences. ,vol. 80, pp. 39- 56 ,(2014) , 10.1016/J.JCSS.2013.03.007
Wei Bi, James T. Kwok, Mandatory leaf node prediction in hierarchical multilabel classification. IEEE Transactions on Neural Networks. ,vol. 25, pp. 2275- 2287 ,(2014) , 10.1109/TNNLS.2014.2309437
Ricardo Cerri, Rodrigo C Barros, André CPLF de Carvalho, None, Hierarchical multi-label classification for protein function prediction: A local approach based on neural networks intelligent systems design and applications. pp. 337- 343 ,(2011) , 10.1109/ISDA.2011.6121678
Wei Bi, James T. Kwok, Hierarchical Multilabel Classification with Minimum Bayes Risk 2012 IEEE 12th International Conference on Data Mining. pp. 101- 110 ,(2012) , 10.1109/ICDM.2012.42