Input space reduction for rule based classification

作者: Kevin S. Tickle , Mohammed M. Mazid , A. B. M. Shawkat Ali

DOI: 10.5555/1852548.1852549

关键词:

摘要: Rule based classification is one of the most popular way in data mining. There are number algorithms for rule classification. C4.5 and Partial Decision Tree (PART) very among them both have many empirical features such as continuous categorization, missing value handling, etc. However cases these takes more processing time provides less accuracy rate correctly classified instances. One main reasons high dimensionality databases. A large dataset might contain hundreds attributes with huge We need to choose related obtain higher accuracy. It also a difficult task proper algorithm perform efficient perfect With our proposed method, we select relevant from by reducing input space simultaneously improve performance two algorithms. The improved measured on better computational complexity. measure Entropy Information Theory identify central attribute dataset. Then apply correlation coefficient namely, Pearson's, Spearman Kendall utilizing same conducted comparative study using three measures best method. picked datasets well known repository UCI (University California Irvine) database. used box plot compare experimental results. Our method has showed individual experiment.

参考文章(35)
Mohammed E. El-Telbany, Mining the classification rules: the egyptian rice diseases as case study international conference on telecommunications. pp. 38- ,(2005)
Saleh A. Wasimi, A. B. M. Shawkat Ali, Data Mining Methods And Techniques ,(2009)
Ramakrishnan Srikant, Rakesh Agrawal, Fast algorithms for mining association rules very large data bases. pp. 580- 592 ,(1998)
Ramakrishnan Srikant, Rakesh Agrawal, Fast Algorithms for Mining Association Rules in Large Databases very large data bases. pp. 487- 499 ,(1994)
Berndt Müller, Joachim Reinhardt, Neural Networks: An Introduction ,(2014)
Ian H. Witten, Eibe Frank, Generating Accurate Rule Sets Without Global Optimization international conference on machine learning. pp. 144- 151 ,(1998)
J. R. Quinlan, R. M. Cameron-Jones, FOIL: A Midterm Report european conference on machine learning. pp. 3- 20 ,(1993) , 10.1007/3-540-56602-3_124
Jiawei Han, Xiaoxin Yin, CPAR: Classification based on Predictive Association Rules. siam international conference on data mining. pp. 331- 335 ,(2003)
William W. Cohen, Fast Effective Rule Induction Machine Learning Proceedings 1995. pp. 115- 123 ,(1995) , 10.1016/B978-1-55860-377-6.50023-2
ShengYi Jiang, Wen Yu, A Combination Classification Algorithm Based on Outlier Detection and C4.5 advanced data mining and applications. pp. 504- 511 ,(2009) , 10.1007/978-3-642-03348-3_50