作者: Ricardo Cerri , Rodrigo C Barros , André CPLF de Carvalho , Alex A Freitas , None
关键词: Grammatical evolution 、 Algorithm 、 Directed acyclic graph 、 Artificial neural network 、 Machine learning 、 Artificial intelligence 、 Tree (data structure) 、 Directed graph 、 Ant colony optimization algorithms 、 Probabilistic logic 、 Genetic algorithm 、 Multi-label classification 、 Protein function prediction 、 Computer science
摘要: Hierarchical Multi-Label Classification (HMC) is a challenging task in data mining and machine learning. Each instance HMC can be classified into two or more classes simultaneously. These are structured hierarchy, the form of either tree directed acyclic graph. Therefore, an assigned to paths from hierarchical structure, resulting complex classification problem with hundreds thousands classes. Several methods have been proposed deal such problems, including several algorithms based on well-known bio-inspired techniques, as neural networks, ant colony optimization, genetic algorithms. In this work, we propose novel global method called GEHM, which makes use grammatical evolution for generating rules. approach, algorithm evolves antecedents rules, order assign instances dataset probabilistic class vector. Our compared protein function prediction datasets. The empirical analysis conducted work shows that GEHM outperforms statistical significance, suggests promising alternative multi-label biological data.