Dynamic Discretization of Continuous Attributes

作者: João Gama , Luis Torgo , Carlos Soares

DOI: 10.1007/3-540-49795-1_14

关键词: SortingMachine learningArtificial intelligenceDiscretizationBenchmark (computing)Bayesian probabilityFeature selectionComputer scienceNaive Bayes classifierDiscretization of continuous featuresDecision tree

摘要: Discretization of continuous attributes is an important task for certain types machine learning algorithms. Bayesian approaches, instance, require assumptions about data distributions. Decision Trees, on the other hand, sorting operations to deal with attributes, which largely increase times. This paper presents a new method discretization, whose main characteristic that it takes into account interdependencies between attributes. Detecting can be seen as discovering redundant means our performs attribute selection side effect discretization. Empirical evaluation five benchmark datasets from UCI repository, using C4.5 and naive Bayes, shows consistent reduction features without loss generalization accuracy.

参考文章(11)
Henry D. Shapiro, Bernard M. E. Moret, Algorithms from P to NP (vol. 1): design and efficiency Benjamin-Cummings Publishing Co., Inc.. ,(1991)
Ron Kohavi, Mehran Sahami, Error-based and entropy-based discretization of continuous features knowledge discovery and data mining. pp. 114- 119 ,(1996)
M. Richeldi, M. Rossotto, Class-Driven Statistical Discretization of Continuous Attributes (Extended Abstract) european conference on machine learning. pp. 335- 338 ,(1995) , 10.1007/3-540-59286-5_81
Michael J. Pazzani, Pedro M. Domingos, Beyond Independence: Conditions for the Optimality of the Simple Bayesian Classifier. international conference on machine learning. pp. 105- 112 ,(1996)
Jason Catlett, Mega induction: a Test Flight Machine Learning Proceedings 1991. pp. 596- 599 ,(1991) , 10.1016/B978-1-55860-200-7.50121-5
Randy Kerber, ChiMerge: discretization of numeric attributes national conference on artificial intelligence. pp. 123- 128 ,(1992)
Keki B. Irani, Usama M. Fayyad, Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning international joint conference on artificial intelligence. ,vol. 2, pp. 1022- 1027 ,(1993)
J. Catlett, On changing continuous attributes into ordered discrete attributes Lecture Notes in Computer Science. pp. 164- 178 ,(1991) , 10.1007/BFB0017012
Luís Torgo, João Gama, Search-Based Class Discretization european conference on machine learning. pp. 266- 273 ,(1997) , 10.1007/3-540-62858-4_91
James Dougherty, Ron Kohavi, Mehran Sahami, Supervised and Unsupervised Discretization of Continuous Features Machine Learning Proceedings 1995. pp. 194- 202 ,(1995) , 10.1016/B978-1-55860-377-6.50032-3