An Improved CART Decision Tree for Datasets with Irrelevant Feature

作者: Ali Mirza Mahmood , Mohammad Imran , Naganjaneyulu Satuluri , Mrithyumjaya Rao Kuppa , Vemulakonda Rajesh

DOI: 10.1007/978-3-642-27172-4_64

关键词:

摘要: Data mining tasks results are usually improved by reducing the dimensionality of data. This improvement however is achieved harder in case that data size moderate or huge. Although numerous algorithms for accuracy have been proposed, all assume inducing a compact and highly generalized model difficult. In order to address above said issue, we introduce Randomized Gini Index (RGI), novel heuristic function reduction, particularly applicable large scale databases. Apart from removing irrelevant attributes, our algorithm capable minimizing level noise greater extend which very attractive feature problems. We extensively evaluate its performance through experiments on both artificial real world datasets. The outcome study shows suitability viability approach knowledge discovery

参考文章(18)
Huan Liu, Hiroshi Motoda, None, Computational Methods of Feature Selection Chapman and Hall/CRC. ,(2007) , 10.1201/9781584888796
Mark A. Hall, Ian H. Witten, Eibe Frank, Data Mining: Practical Machine Learning Tools and Techniques ,(1999)
Leyli Mohammad Khanli, Farnaz Mahan, Ayaz Isazadeh, Active rule learning using decision tree for resource management in Grid computing Future Generation Computer Systems. ,vol. 27, pp. 703- 710 ,(2011) , 10.1016/J.FUTURE.2010.12.016
Barak Aviad, Gelbard Roy, Classification by clustering decision tree-like classifier based on adjusted clusters Expert Systems With Applications. ,vol. 38, pp. 8220- 8228 ,(2011) , 10.1016/J.ESWA.2011.01.001
Ali Mirza Mahmood, Mrithyumjaya Rao Kuppa, A novel pruning approach using expert knowledge for data-specific pruning Engineering with Computers. ,vol. 28, pp. 21- 30 ,(2012) , 10.1007/S00366-011-0214-1
X. Tan, B. Bhanu, Y. Lin, Fingerprint classification based on learned features systems man and cybernetics. ,vol. 35, pp. 287- 300 ,(2005) , 10.1109/TSMCC.2005.848167
Duen-Ren Liu, Chin-Hui Lai, Wang-Jung Lee, A hybrid of sequential rules and collaborative filtering for product recommendation Information Sciences. ,vol. 179, pp. 3505- 3519 ,(2009) , 10.1016/J.INS.2009.06.004
Wang Yi, The Cascade Decision-Tree Improvement Algorithm Based on Unbalanced Data Set communications and mobile computing. ,vol. 1, pp. 284- 288 ,(2010) , 10.1109/CMC.2010.171
Pei-Chann Chang, Chin-Yuan Fan, Jun-Lin Lin, Trend discovery in financial time series data using a case based fuzzy decision tree Expert Systems With Applications. ,vol. 38, pp. 6070- 6080 ,(2011) , 10.1016/J.ESWA.2010.11.006
Smith Tsang, Ben Kao, Kevin Y. Yip, Wai-Shing Ho, Sau Dan Lee, Decision Trees for Uncertain Data IEEE Transactions on Knowledge and Data Engineering. ,vol. 23, pp. 64- 78 ,(2011) , 10.1109/TKDE.2009.175