Analyzing the presence of noise in multi-class problems: alleviating its influence with the One-vs-One decomposition

作者: José A. Sáez , Mikel Galar , Julián Luengo , Francisco Herrera

DOI: 10.1007/S10115-012-0570-1

关键词:

摘要: The presence of noise in data is a common problem that produces several negative consequences classification problems. In multi-class problems, these are aggravated terms accuracy, building time, and complexity the classifiers. cases, an interesting approach to reduce effect decompose into binary subproblems, reducing and, consequently, dividing effects caused by each subproblems. This paper analyzes usage decomposition strategies, more specifically One-vs-One scheme, deal with noisy datasets. order investigate whether able or not, large number datasets created introducing different levels types noise, as suggested literature. Several well-known algorithms, without decomposition, trained on them check when advantageous. results obtained show methods using strategy lead better performances robust classifiers dealing data, especially most disruptive schemes.

参考文章(61)
Matjaz Kukar, Igor Kononenko, Machine Learning and Data Mining: Introduction to Principles and Algorithms Horwood Publishing Limited. ,(2007)
Nada Lavrac, Ciril Groselj, Dragan Gamberger, Experiments with Noise Filtering in a Medical Domain international conference on machine learning. pp. 143- 151 ,(1999)
Xindong Wu, Xingquan Zhu, Ying Yang, Error detection and impact-sensitive instance ranking in noisy datasets national conference on artificial intelligence. pp. 378- 383 ,(2004)
Jeanny Hérault, Françoise Fogelman-Soulié, Neurocomputing : algorithms, architectures and applications Springer-Verlag. ,(1990)
David G. Stork, Richard O. Duda, Peter E. Hart, Pattern Classification (2nd ed.) ,(1999)
Ting-Fan Wu, Chih-Jen Lin, Ruby Weng, None, Probability Estimates for Multi-class Classification by Pairwise Coupling Journal of Machine Learning Research. ,vol. 5, pp. 975- 1005 ,(2004) , 10.5555/1005332.1016791
Miguel Moreira, Eddy Mayoraz, On the Decomposition of Polychotomies into Dichotomies international conference on machine learning. pp. 219- 226 ,(1997)
Choh-Man Teng, Correcting Noisy Data international conference on machine learning. pp. 239- 248 ,(1999)
Janez Demšar, Statistical Comparisons of Classifiers over Multiple Data Sets Journal of Machine Learning Research. ,vol. 7, pp. 1- 30 ,(2006)