A comparative analysis of methods for pruning decision trees

作者: F. Esposito , D. Malerba , G. Semeraro , J. Kay

DOI: 10.1109/34.589207

关键词:

摘要: In this paper, we address the problem of retrospectively pruning decision trees induced from data, according to a top-down approach. This has received considerable attention in areas pattern recognition and machine learning, many distinct methods have been proposed literature. We make comparative study six well-known with aim understanding their theoretical foundations, computational complexity, strengths weaknesses formulation. Comments on characteristics each method are empirically supported. particular, wide experimentation performed several data sets leads us opposite conclusions predictive accuracy simplified some drawn attribute divergence differences experimental designs. Finally, prove use property reduced error obtain an objective evaluation tendency overprune/underprune observed method.

参考文章(32)
Keki B. Irani, Usama M. Fayyad, The attribute selection problem in decision tree generation national conference on artificial intelligence. pp. 104- 110 ,(1992)
Tim Niblett, Constructing Decision Trees in Noisy Domains. EWSL. pp. 67- 78 ,(1987)
Christopher J. C. H. Watkins, Combining Cross-Validation and Search. EWSL. pp. 79- 87 ,(1987)
Robert C. Holte, Bruce W. Porter, Liane E. Acker, Concept learning and the problem of small disjuncts international joint conference on artificial intelligence. pp. 813- 818 ,(1989)
I Bratko, T. Niblett, Learning decision rules in noisy domains Proceedings of Expert Systems '86, The 6Th Annual Technical Conference on Research and development in expert systems III. pp. 25- 34 ,(1987)
Ivan Bratko, Igor Kononenko, Bojan Cestnik, ASSISTANT 86: a knowledge-elicitation tool for sophisticated users EWSL'87 Proceedings of the 2nd European Conference on European Working Session on Learning. pp. 31- 45 ,(1987)
Steven L. Salzberg, Alberto Segre, Programs for Machine Learning ,(1994)
Ron Kohavi, George H. John, Automatic Parameter Selection by Minimizing Estimated Error Machine Learning Proceedings 1995. pp. 304- 312 ,(1995) , 10.1016/B978-1-55860-377-6.50045-1
Cullen Schaffer, Deconstructing the Digit Recognition Problem international conference on machine learning. pp. 394- 399 ,(1992) , 10.1016/B978-1-55860-247-2.50056-5
Richard A Olshen, Charles J Stone, Leo Breiman, Jerome H Friedman, Classification and regression trees ,(1983)