Overfitting Avoidance as Bias

作者: Cullen Schaffer

DOI: 10.1023/A:1022653209073

关键词:

摘要: Strategies for increasing predictive accuracy through selective pruning have been widely adopted by researchers in decision tree induction. It is easy to get the impression from research reports that there are statistical reasons believing these overfitting avoidance strategies do increase and that, as a community, we making progress toward developing powerful, general methods guarding against inducing trees. In fact, any strategy amounts form of bias and, such, may degrade performance instead improving it. If often proven successful empirical tests, this due, not methods, but choice test problems. As examples article illustrate, better or worse, only more less appropriate specific application domains. We not—and cannot be—making both powerful general.

参考文章(21)
Sholom M. Weiss, Nitin Indurkhya, Reduced complexity rule induction international joint conference on artificial intelligence. pp. 678- 684 ,(1991)
I Bratko, T. Niblett, Learning decision rules in noisy domains Proceedings of Expert Systems '86, The 6Th Annual Technical Conference on Research and development in expert systems III. pp. 25- 34 ,(1987)
Cullen Schaffer, Deconstructing the Digit Recognition Problem international conference on machine learning. pp. 394- 399 ,(1992) , 10.1016/B978-1-55860-247-2.50056-5
Richard A Olshen, Charles J Stone, Leo Breiman, Jerome H Friedman, Classification and regression trees ,(1983)
J. G. Carbonell, T. M. Mitchell, R. S. Michalski, Machine Learning: An Artificial Intelligence Approach Springer Publishing Company, Incorporated. ,(2013)
Bojan Cestnik, Ivan Bratko, On estimating probabilities in tree pruning Lecture Notes in Computer Science. pp. 138- 150 ,(1991) , 10.1007/BFB0017010
John Mingers, An Empirical Comparison of Pruning Methods for Decision Tree Induction Machine Learning. ,vol. 4, pp. 227- 243 ,(1989) , 10.1023/A:1022604100933
Cullen Schaffer, Sparse data and the effect of overfitting avoidance in decision tree induction national conference on artificial intelligence. pp. 147- 152 ,(1992)