A grammatical evolution algorithm for generation of Hierarchical Multi-Label Classification rules

作者: Ricardo Cerri , Rodrigo C Barros , André CPLF de Carvalho , Alex A Freitas , None

DOI: 10.1109/CEC.2013.6557604

关键词: Grammatical evolutionAlgorithmDirected acyclic graphArtificial neural networkMachine learningArtificial intelligenceTree (data structure)Directed graphAnt colony optimization algorithmsProbabilistic logicGenetic algorithmMulti-label classificationProtein function predictionComputer science

摘要: Hierarchical Multi-Label Classification (HMC) is a challenging task in data mining and machine learning. Each instance HMC can be classified into two or more classes simultaneously. These are structured hierarchy, the form of either tree directed acyclic graph. Therefore, an assigned to paths from hierarchical structure, resulting complex classification problem with hundreds thousands classes. Several methods have been proposed deal such problems, including several algorithms based on well-known bio-inspired techniques, as neural networks, ant colony optimization, genetic algorithms. In this work, we propose novel global method called GEHM, which makes use grammatical evolution for generating rules. approach, algorithm evolves antecedents rules, order assign instances dataset probabilistic class vector. Our compared protein function prediction datasets. The empirical analysis conducted work shows that GEHM outperforms statistical significance, suggests promising alternative multi-label biological data.

参考文章(32)
Svetlana Kiritchenko, Fazel Famili, Stan Matwin, Functional Annotation of Genes Using Hierarchical Text Categorization ,(2005)
Fernando E. B. Otero, Alex A. Freitas, Colin G. Johnson, A Hierarchical Classification Ant Colony Algorithm for Predicting Gene Ontology Terms evolutionary computation machine learning and data mining in bioinformatics. pp. 68- 79 ,(2009) , 10.1007/978-3-642-01184-9_7
M. Dorigo, Optimization, Learning and Natural Algorithms Ph.D. Thesis, Politecnico di Milano, Italy. ,(1992)
Svetlana Kiritchenko, Stan Matwin, Richard Nock, A. Fazel Famili, Learning and Evaluation in the Presence of Class Hierarchies: Application to Text Categorization Advances in Artificial Intelligence. pp. 395- 406 ,(2006) , 10.1007/11766247_34
Hendrik Blockeel, Leander Schietgat, Jan Struyf, Sašo Džeroski, Amanda Clare, Decision Trees for Hierarchical Multilabel Classification: A Case Study in Functional Genomics Lecture Notes in Computer Science. ,vol. 4213, pp. 18- 29 ,(2006) , 10.1007/11871637_7
Giorgio Valentini, True Path Rule Hierarchical Ensembles multiple classifier systems. pp. 232- 241 ,(2009) , 10.1007/978-3-642-02326-2_24
Janez Demšar, Statistical Comparisons of Classifiers over Multiple Data Sets Journal of Machine Learning Research. ,vol. 7, pp. 1- 30 ,(2006)
Roberto T. Alves, Myriam R. Delgado, Alex A. Freitas, Multi-label Hierarchical Classification of Protein Functions with Artificial Immune Systems Advances in Bioinformatics and Computational Biology. pp. 1- 12 ,(2008) , 10.1007/978-3-540-85557-6_1
Nir Friedman, Dan Geiger, Moises Goldszmidt, Bayesian Network Classifiers Machine Learning. ,vol. 29, pp. 131- 163 ,(1997) , 10.1023/A:1007465528199
Ricardo Cerri, Rodrigo C Barros, Andre CPLF de Carvalho, None, A genetic algorithm for Hierarchical Multi-Label Classification acm symposium on applied computing. pp. 250- 255 ,(2012) , 10.1145/2245276.2245325