Optimizing Decision Tree Through Attributes Generation Using Genetic Programming for Clinical Data

作者: Narander Kumar , Sabita Khatri , ,

DOI: 10.17485/IJST/2017/V10I22/112117

关键词:

摘要: Objective: To intend towards increasing classification efficiency of J48 classifier by introducing attribute set on the basis applied genetic programming. The constructed not only enhances data capabilities but also increased space for algorithm giving more accurate results. Methods/Analysis: datasets related to heart and liver disease were selected from UCI machine learning repositories. experiment has been conducted with help WEKA tool, which is an open source tool mining. Finding: After experimentation it found that better accuracy reduced error rate when after inclusion newly generated attributes adding induced programming, significant boost can be seen in 74% 83% 68% 72% respectively. Improvement: We obtained results compared existing literature chosen clinical datasets.

参考文章(13)
Santosh Kumar, G. Sahoo, Classification of Heart Disease Using Naïve Bayes and Genetic Algorithm Computational Intelligence in Data Mining - Volume 2. pp. 269- 282 ,(2015) , 10.1007/978-81-322-2208-8_25
Fernando E. B. Otero, Monique M. S. Silva, Alex A. Freitas, Julio C. Nievola, Genetic programming for attribute construction in data mining genetic and evolutionary computation conference. pp. 1270- 1270 ,(2002) , 10.1007/3-540-36599-0_36
Gisele L. Pappa, Alex A. Freitas, Celso A. A. Kaestner, Attribute Selection with a Multi-objective Genetic Algorithm brazilian symposium on artificial intelligence. pp. 280- 290 ,(2002) , 10.1007/3-540-36127-8_27
Muhammad Waqar Aslam, Asoke Kumar Nandi, Detection of diabetes using genetic programming european signal processing conference. pp. 1184- 1188 ,(2010)
T. Santhanam, E. P. Ephzibah, Heart Disease Prediction Using Hybrid Genetic Fuzzy Model Indian journal of science and technology. ,vol. 8, pp. 797- 803 ,(2015) , 10.17485/IJST/2015/V8I9/52930
Sreerama K. Murthy, Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey Data Mining and Knowledge Discovery. ,vol. 2, pp. 345- 389 ,(1998) , 10.1023/A:1009744630224
Usama Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth, The KDD process for extracting useful knowledge from volumes of data Communications of the ACM. ,vol. 39, pp. 27- 34 ,(1996) , 10.1145/240455.240464
D.P. Muni, N.R. Pal, J. Das, A novel approach to design classifiers using genetic programming IEEE Transactions on Evolutionary Computation. ,vol. 8, pp. 183- 196 ,(2004) , 10.1109/TEVC.2004.825567
Mark Hall, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, Ian H. Witten, The WEKA data mining software ACM SIGKDD Explorations Newsletter. ,vol. 11, pp. 10- 18 ,(2009) , 10.1145/1656274.1656278
Ricardo Aler, César Estébanez, José María Valls, A Method Based on Genetic Programming for Improving the Quality of Datasets in Classification Problems International Journal of Computer Science & Applications. ,vol. 4, pp. 69- 80 ,(2007)