A decomposition of classes via clustering to explain and improve Naive Bayes

作者: Ricardo Vilalta , Irina Rish

DOI: 10.1007/978-3-540-39857-8_40

关键词: Product (mathematics)Artificial intelligenceDiscriminant function analysisClass (set theory)Computer scienceDecomposition (computer science)Cluster analysisNaive Bayes classifierMachine learningConditional probabilityPattern recognition

摘要: We propose a method to improve the probability estimates made by Naive Bayes avoid effects of poor class conditional probabilities based on product distributions when each spreads into multiple regions. Our approach is applying clustering algorithm subset examples that belong same class, and consider cluster as its own. Experiments 26 real-world datasets show significant improvement in performance decomposition process applied, particularly mean number clusters per large.

参考文章(17)
Robert C. Holte, Bruce W. Porter, Liane E. Acker, Concept learning and the problem of small disjuncts international joint conference on artificial intelligence. pp. 813- 818 ,(1989)
Gerald W. Kimble, Information and Computer Science Information and Computer Science. pp. 180- 180 ,(1975)
David G. Stork, Richard O. Duda, Peter E. Hart, Pattern Classification (2nd ed.) ,(1999)
Ashutosh Garg, Dan Roth, Understanding Probabilistic Classifiers european conference on machine learning. pp. 179- 191 ,(2001) , 10.1007/3-540-44795-4_16
Ron Kohavi, Scaling up the accuracy of Naive-Bayes classifiers: a decision-tree hybrid knowledge discovery and data mining. pp. 202- 207 ,(1996)
Huajie Zhang, Charles X. Ling, Geometric Properties of Naive Bayes in Nominal Domains european conference on machine learning. pp. 588- 599 ,(2001) , 10.1007/3-540-44795-4_50
Ron Kohavi, Dan Sommerfield, Barry G. Becker, Improving simple Bayes ECML. ,(1997)
Bianca Zadrozny, Charles Elkan, Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers international conference on machine learning. pp. 609- 616 ,(2001)
Geoffrey I. Webb, Michael J. Pazzani, Adjusted Probability Naive Bayesian Induction australian joint conference on artificial intelligence. pp. 285- 295 ,(1998) , 10.1007/BFB0095060
Nir Friedman, Dan Geiger, Moises Goldszmidt, Bayesian Network Classifiers Machine Learning. ,vol. 29, pp. 131- 163 ,(1997) , 10.1023/A:1007465528199