作者: Ricardo Vilalta , Irina Rish
DOI: 10.1007/978-3-540-39857-8_40
关键词: Product (mathematics) 、 Artificial intelligence 、 Discriminant function analysis 、 Class (set theory) 、 Computer science 、 Decomposition (computer science) 、 Cluster analysis 、 Naive Bayes classifier 、 Machine learning 、 Conditional probability 、 Pattern recognition
摘要: We propose a method to improve the probability estimates made by Naive Bayes avoid effects of poor class conditional probabilities based on product distributions when each spreads into multiple regions. Our approach is applying clustering algorithm subset examples that belong same class, and consider cluster as its own. Experiments 26 real-world datasets show significant improvement in performance decomposition process applied, particularly mean number clusters per large.