Analysis of Feature Selection Algorithms on Classification: A Survey

作者: S. Vanaja , K. Ramesh Kumar

DOI: 10.5120/16888-6910

关键词: Artificial neural networkGenetic algorithmAssociation rule learningLinear classifierKnowledge representation and reasoningOne-class classificationData miningFeature selectionNaive Bayes classifierMachine learningRule-based systemDecision treeRough setComputer scienceArtificial intelligenceBackpropagationSupport vector machineFuzzy setKnowledge extraction

摘要: 1. INTRODUCTIONmining is a process of knowledge discovery. The KDD an automated discovery from the original data. consists many steps like data cleaning, integration, selection, transformation, mining, pattern evaluation and representation. Among selection very much important to select relevant feature remove irrelevant attributes. Classification one datamining techniques used discover unknown class. different classification methods in mining are Bayesian (Statistical classifier), Decision tree induction, Rule based (IF THEN Rule), using Back propagation (Neural network algorithm), Support vector machine, Association Rule, k-nearest neighbor classifiers, casebased reasoning Rough set approach, Genetic algorithm, Fuzzy approach.

参考文章(31)
Grzegorz Ilczuk, Alicja Wakulicz-Deja, Selection of important attributes for medical diagnosis systems Transactions on rough sets VII. ,vol. 7, pp. 70- 84 ,(2007) , 10.1007/978-3-540-71663-1_5
Richi Nayak, Ellen Pitt, The use of various data mining and feature selection methods in the analysis of a population survey dataset AIDM '07 Proceedings of the 2nd international workshop on Integrating artificial intelligence and data mining - Volume 84. pp. 83- 93 ,(2007)
Pasi Luukka, Feature selection using fuzzy entropy measures with similarity classifier Expert Systems With Applications. ,vol. 38, pp. 4600- 4607 ,(2011) , 10.1016/J.ESWA.2010.09.133
Liangxiao Jiang, Zhihua Cai, Harry Zhang, Dianhong Wang, Not so greedy: Randomly Selected Naive Bayes Expert Systems With Applications. ,vol. 39, pp. 11022- 11028 ,(2012) , 10.1016/J.ESWA.2012.03.022
Huan Liu, Hiroshi Motoda, Lei Yu, A selective sampling approach to active feature selection Artificial Intelligence. ,vol. 159, pp. 49- 74 ,(2004) , 10.1016/J.ARTINT.2004.05.009
Cesar Iyakaremye, Pasi Luukka, David Koloseni, Feature selection using Yu's similarity measure and fuzzy entropy measures ieee international conference on fuzzy systems. pp. 1- 6 ,(2012) , 10.1109/FUZZ-IEEE.2012.6250817
Pablo Bermejo, José A Gámez, José M Puerta, None, Speeding up incremental wrapper feature subset selection with Naive Bayes classifier Knowledge Based Systems. ,vol. 55, pp. 140- 147 ,(2014) , 10.1016/J.KNOSYS.2013.10.016
Piyushkumar A. Mundra, Jagath C. Rajapakse, Gene and sample selection for cancer classification with support vectors based t-statistic Neurocomputing. ,vol. 73, pp. 2353- 2362 ,(2010) , 10.1016/J.NEUCOM.2010.02.025
Hui-Ling Chen, Bo Yang, Jie Liu, Da-You Liu, A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis Expert Systems With Applications. ,vol. 38, pp. 9014- 9022 ,(2011) , 10.1016/J.ESWA.2011.01.120
Mark Hall, A decision tree-based attribute weighting filter for naive Bayes Knowledge Based Systems. ,vol. 20, pp. 120- 126 ,(2007) , 10.1016/J.KNOSYS.2006.11.008