作者: Carlos Alarcón Jaimes
DOI:
关键词:
摘要: The Naive Bayes classifier is one of the most effective classification models, due to their simplicity, resistance noise, little processing time and high predictive power. assumes a strong assumption independence between predictor variables, which generally not met. Many studies seek improve power relaxing this independence, as selection subset variables that are independent or approximately independent. In paper, method seeks optimize using C4.5 decision tree presented. This selects in data set induced for go apply these selected variables. With previous use achieved removing redundant / irrelevant dataset choose those more informative tasks, thus classifier. illustrated three sets from UCI repository, Irvin Repository Machine Learning databases at University California National Household Survey Institute Statistics Informatics Peru, ENAHO INEI , implemented WEKA program.