Mutual Information Estimation for Filter Based Feature Selection Using Particle Swarm Optimization

作者: Hoai Bach Nguyen , Bing Xue , Peter Andreae

DOI: 10.1007/978-3-319-31204-0_46

关键词: Multi-swarm optimizationComputer scienceOverfittingFeature selectionk-nearest neighbors algorithmParticle swarm optimizationArtificial intelligenceMutual informationPattern recognitionMetaheuristicFitness function

摘要: Feature selection is a pre-processing step in classification, which selects small set of important features to improve the classification performance and efficiency. Mutual information very popular feature because it able detect non-linear relationship between features. However existing mutual approaches only consider two-way interaction In addition, most methods, calculated by counting approach, may lead an inaccurate results. This paper proposes filter algorithm based on particle swarm optimization (PSO) named PSOMIE, employs novel fitness function using nearest neighbor estimation (NNE) measure quality set. PSOMIE compared with all two traditional approaches. The experiment results show that successfully guides PSO search for number while maintaining or improving over methods. provides strong consistency training test results, be used avoid overfitting problem.

参考文章(33)
Bing Xue, Mengjie Zhang, Will N. Browne, Xin Yao, A Survey on Evolutionary Computation Approaches to Feature Selection IEEE Transactions on Evolutionary Computation. ,vol. 20, pp. 606- 626 ,(2016) , 10.1109/TEVC.2015.2504420
Kevin Bache, Moshe Lichman, UCI Machine Learning Repository University of California, School of Information and Computer Science. ,(2007)
Rachel Hunt, Kourosh Neshatian, Mengjie Zhang, A genetic programming approach to hyper-heuristic feature selection simulated evolution and learning. pp. 320- 330 ,(2012) , 10.1007/978-3-642-34859-4_32
Gauthier Doquire, Michel Verleysen, A Performance Evaluation of Mutual Information Estimators for Multivariate Feature Selection Springer, Berlin, Heidelberg. pp. 51- 63 ,(2013) , 10.1007/978-3-642-36530-0_5
Igor Kononenko, On biases in estimating multi-valued attributes international joint conference on artificial intelligence. pp. 1034- 1040 ,(1995)
Urvesh Bhowan, D. J. McCloskey, Genetic Programming for Feature Selection and Question-Answer Ranking in IBM Watson Lecture Notes in Computer Science. pp. 153- 166 ,(2015) , 10.1007/978-3-319-16501-1_13
Manoranjan Dash, Huan Liu, Hiroshi Motoda, Consistency Based Feature Selection pacific asia conference on knowledge discovery and data mining. pp. 98- 109 ,(2000) , 10.1007/3-540-45571-X_12
Tony Butler-Yeoman, Bing Xue, Mengjie Zhang, Particle swarm optimisation for feature selection: A hybrid filter-wrapper approach congress on evolutionary computation. pp. 2428- 2435 ,(2015) , 10.1109/CEC.2015.7257186
Bing Xue, Mengjie Zhang, Will N. Browne, Particle swarm optimisation for feature selection in classification soft computing. ,vol. 18, pp. 261- 276 ,(2014) , 10.1016/J.ASOC.2013.09.018
Mark Andrew Hall, Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning international conference on machine learning. pp. 359- 366 ,(2000)