Using Weighted Hybrid Discretization Method to Analyze Climate Changes

作者: Yong-Gyu Jung , Kyoung Min Kim , Young Man Kwon

DOI: 10.1007/978-3-642-35600-1_28

关键词:

摘要: Data mining is the process of posing queries to large quantities data and extracting information, often previously unknown, using mathematical, statistical machine learning techniques. However some techniques like classification clustering cannot deal with numeric attributes though most real dataset contains attributes. Continuous should be divided into a small distinct range nominal in order apply Correct discretization makes succinct contributes high performance algorithms. Meanwhile, several methods are presented applied, but it dependent on area. In this paper, we propose weighted hybrid technique based entropy contingency coefficient. Also analyze evaluation well-known such as Equal-width binning, 1R, MDLP ChiMerge.

参考文章(11)
Ron Kohavi, Mehran Sahami, Error-based and entropy-based discretization of continuous features knowledge discovery and data mining. pp. 114- 119 ,(1996)
Randy Kerber, ChiMerge: discretization of numeric attributes national conference on artificial intelligence. pp. 123- 128 ,(1992)
Vipin Kumar, Pang-Ning Tan, Michael M. Steinbach, Introduction to Data Mining ,(2013)
Mark A. Hall, Ian H. Witten, Eibe Frank, Data Mining: Practical Machine Learning Tools and Techniques ,(1999)
Keki B. Irani, Usama M. Fayyad, Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning international joint conference on artificial intelligence. ,vol. 2, pp. 1022- 1027 ,(1993)
Petra Perner, Sascha Trautzsch, Multi-interval Discretization Methods for Decision Tree Learning Lecture Notes in Computer Science. pp. 475- 482 ,(1998) , 10.1007/BFB0033269
Huan Liu, R. Setiono, Feature selection via discretization IEEE Transactions on Knowledge and Data Engineering. ,vol. 9, pp. 642- 645 ,(1997) , 10.1109/69.617056
A. Barron, J. Rissanen, Bin Yu, The minimum description length principle in coding and modeling IEEE Transactions on Information Theory. ,vol. 44, pp. 2743- 2760 ,(1998) , 10.1109/18.720554
Qiusha Zhu, Lin Lin, Mei-Ling Shyu, Shu-Ching Chen, Effective supervised discretization for classification based on correlation maximization information reuse and integration. pp. 390- 395 ,(2011) , 10.1109/IRI.2011.6009579