Evolving accurate and comprehensible classification rules

作者: Cecilia Sonstrod , Ulf Johansson , Rikard Konig

DOI: 10.1109/CEC.2011.5949784

关键词:

摘要: In this paper, Genetic Programming is used to evolve ordered rule sets (also called decision lists) for a number of benchmark classification problems, with evaluation both predictive performance and comprehensibility. The main purpose compare approach the standard list algorithm JRip also evaluate use different length penalties fitness functions evolving type model. results, using 25 data from UCI repository, show that genetic lists accuracy-based outperform regarding accuracy. Indeed, best setup was significantly better than JRip. JRip, however, held slight advantage over these models when evaluating AUC. Furthermore, all setups produced were more compact models, thus readily comprehensible. effect very clear; in essence, performed on criterion function, worsening other criteria. Brier score provided middle ground, acceptable accuracy conclusion programming solves task well, but have immediate effects results. Thus, parameters can be control trade-off between aspects

参考文章(23)
Lars Niklasson, Rikard König, Ulf Johansson, Using Genetic Programming to Increase Rule Quality the florida ai research society. pp. 288- 293 ,(2008)
Foster Provost, R Fawcett, T, Kohavi, The Case against Accuracy Estimation for Comparing Induction Algorithms international conference on machine learning. pp. 445- 453 ,(1998)
Vipin Kumar, Pang-Ning Tan, Michael M. Steinbach, Introduction to Data Mining ,(2013)
Janez Demšar, Statistical Comparisons of Classifiers over Multiple Data Sets Journal of Machine Learning Research. ,vol. 7, pp. 1- 30 ,(2006)
Cecilia Sönströd, Ulf Johansson, None, Towards a Unified View on Concept Description international conference on data mining. pp. 59- 65 ,(2007)
Johannes Fürnkranz, Peter Flach, An analysis of stopping and filtering criteria for rule learning european conference on machine learning. pp. 123- 133 ,(2004) , 10.1007/978-3-540-30115-8_14
William W. Cohen, Fast Effective Rule Induction Machine Learning Proceedings 1995. pp. 115- 123 ,(1995) , 10.1016/B978-1-55860-377-6.50023-2
Milton Friedman, The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance Journal of the American Statistical Association. ,vol. 32, pp. 675- 701 ,(1937) , 10.2307/2279372
Emiliano Carreno, Guillermo Leguizamon, Neal Wagner, Evolution of classification rules for comprehensible knowledge discovery congress on evolutionary computation. pp. 1261- 1268 ,(2007) , 10.1109/CEC.2007.4424615