作者: Ali Mirza Mahmood , Mrithyumjaya Rao Kuppa
DOI: 10.1007/S00366-011-0214-1
关键词:
摘要: Classification is an important data mining task that discovers hidden knowledge from the labeled datasets. Most approaches to pruning assume all dataset are equally uniform and important, so they apply equal However, in real-world classification problems, datasets not considering rate during tends generate a decision tree with large size high misclassification rate. We approach problem by first investigating properties of each then deriving data-specific value using expert which used design techniques prune trees close perfection. An efficient algorithm dubbed EKBP proposed very general as we free use any learning base classifier. have implemented our solution experimentally verified its effectiveness forty real world benchmark UCI machine repository. In these experiments, shows it can dramatically reduce while enhancing or retaining level accuracy.