Hierarchical multi-label classification for protein function prediction: A local approach based on neural networks

作者: Ricardo Cerri , Rodrigo C Barros , André CPLF de Carvalho , None

DOI: 10.1109/ISDA.2011.6121678

关键词:

摘要: In Hierarchical Multi-Label Classification problems, each instance can be classified into two or more classes simultaneously, differently from conventional classification. Additionally, the are structured in a hierarchy, form of either tree directed acyclic graph. Hence, an assigned to paths hierarchical structure, resulting complex classification problem with possibly hundreds classes. Many methods have been proposed deal such some them employing single classifier all simultaneously (global methods), and others many classifiers decompose original set subproblems (local methods). this work, we propose novel local method named HMC-LMLP, which uses one Multi-Layer Perceptron per level. The predictions level used as inputs network responsible for next We make use distinct algorithms: Back-propagation Resilient Back-propagation. addition, error measure specially tailored multi-label problems training networks. Our is compared state-of-the-art algorithms, protein function prediction datasets. experimental results show that our approach presents competitive predictive accuracy, suggesting artificial neural networks constitute promising alternative biological data.

参考文章(31)
Svetlana Kiritchenko, Fazel Famili, Stan Matwin, Functional Annotation of Genes Using Hierarchical Text Categorization ,(2005)
Fernando E. B. Otero, Alex A. Freitas, Colin G. Johnson, A Hierarchical Classification Ant Colony Algorithm for Predicting Gene Ontology Terms evolutionary computation machine learning and data mining in bioinformatics. pp. 68- 79 ,(2009) , 10.1007/978-3-642-01184-9_7
M. J. D. Powell, Radial basis functions for multivariable interpolation: a review Algorithms for approximation. pp. 143- 167 ,(1987)
Luc De Raedt, Hendrik Blockeel, Jan Ramon, Top-Down Induction of Clustering Trees international conference on machine learning. pp. 55- 63 ,(1998)
Hendrik Blockeel, Leander Schietgat, Jan Struyf, Sašo Džeroski, Amanda Clare, Decision Trees for Hierarchical Multilabel Classification: A Case Study in Functional Genomics Lecture Notes in Computer Science. ,vol. 4213, pp. 18- 29 ,(2006) , 10.1007/11871637_7
Giorgio Valentini, True Path Rule Hierarchical Ensembles multiple classifier systems. pp. 232- 241 ,(2009) , 10.1007/978-3-642-02326-2_24
Janez Demšar, Statistical Comparisons of Classifiers over Multiple Data Sets Journal of Machine Learning Research. ,vol. 7, pp. 1- 30 ,(2006)
Jan Struyf, Sašo Džeroski, Hendrik Blockeel, Amanda Clare, Hierarchical Multi-classification with Predictive Clustering Trees in Functional Genomics Progress in Artificial Intelligence. ,vol. 3808, pp. 272- 283 ,(2005) , 10.1007/11595014_27
Kevin Thompson, Pat Langley, and Wayne Iba, An analysis of Bayesian classifiers national conference on artificial intelligence. pp. 223- 228 ,(1992)
Ricardo Cerri, André Carlos PLF de Carvalho, Alex A Freitas, None, Adapting non-hierarchical multilabel classification methods for hierarchical multilabel classification intelligent data analysis. ,vol. 15, pp. 861- 887 ,(2011) , 10.3233/IDA-2011-0500