A study on the application of instance selection techniques in genetic fuzzy rule-based classification systems: Accuracy-complexity trade-off

作者: Michela Fazzolari , Bruno Giglio , Rafael Alcalá , Francesco Marcelloni , Francisco Herrera

DOI: 10.1016/J.KNOSYS.2013.07.011

关键词: Training setInstance selectionFuzzy numberMachine learningData miningGenetic algorithmDefuzzificationEvolutionary algorithmFuzzy set operationsFuzzy ruleFuzzy classificationGenetic fuzzy systemsNeuro-fuzzyArtificial intelligenceComputer science

摘要: In the framework of genetic fuzzy systems, computational time required by algorithms for generating rule-based models from data increases considerably with increase number instances in training set, mainly due to fitness evaluation. Also, amount typically affects complexity resulting model: a higher generally induces generation rules. Since rules is considered one factors which affect interpretability models, large datasets bring less interpretable models. Both these problems can be tackled and partially solved reducing before applying evolutionary process. literature several instance selection have been proposed selecting without deteriorating accuracy generated The aim this paper analyze effectiveness 36 set methods when combined classification systems. Using 37 different sizes we show that some help reduce process decrease very limited their respect using overall set.

参考文章(31)
Branko Kavšek, Nada Lavrač, Viktor Jovanoski, APRIORI-SD: Adapting Association Rule Learning to Subgroup Discovery Advances in Intelligent Data Analysis V. ,vol. 20, pp. 230- 241 ,(2003) , 10.1007/978-3-540-45231-7_22
Janez Demšar, Statistical Comparisons of Classifiers over Multiple Data Sets Journal of Machine Learning Research. ,vol. 7, pp. 1- 30 ,(2006)
Chengqi Zhang, Shichao Zhang, Association Rule Mining: Models and Algorithms ,(2002)
Oscar Cordón, Frank Hoffmann, Luis Magdalena, Francisco Herrera, Genetic Fuzzy Systems: Evolutionary Tuning And Learning Of Fuzzy Knowledge Bases ,(2002)
Frank Wilcoxon, Individual Comparisons by Ranking Methods Springer Series in Statistics. ,vol. 1, pp. 196- 202 ,(1992) , 10.1007/978-1-4612-4380-9_16
J. Arturo Olvera-López, J. Ariel Carrasco-Ochoa, J. Francisco Martínez-Trinidad, Josef Kittler, A review of instance selection methods Artificial Intelligence Review. ,vol. 34, pp. 133- 143 ,(2010) , 10.1007/S10462-010-9165-Y