作者: Sholom M. Weiss , Nitin Indurkhya
DOI:
关键词:
摘要: We evaluate the performance of weakest-link pruning decision trees using cross-validation. This technique maps tree into a problem selection: Find best (i.e. right-sized) tree, from set ranging in size unpruned to null tree. For samples with at least 200 cases, extensive empirical evidence supports following conclusions relative (a) 10-fold cross-validation is nearly unbiased; (b) not covering highly biased; (c) consistent optimal selection for large sample sizes and (d) accuracy by largely dependent on size, irrespective population distribution.