Feature selection for high dimensional data: An evolutionary filter approach.

作者: Anwar Ali Yahya , Addin Osman , Abd Rahman Ramli , Adlan Balola

DOI: 10.3844/JCSSP.2011.800.820

关键词:

摘要: Problem statement: Feature selection is a task of crucial importance for the application of machine learning in various domains. In addition, recent increase data dimensionality poses a severe challenge to many existing feature approaches with respect efficiency and effectiveness. As an example, genetic algorithm effective search that lends itself directly selection; however this direct hindered by data dimensionality. Therefore adapting cope high data becomes increasingly appealing. Approach: study, we proposed adapted version genetic algorithm can be applied dimensional data. The approach is based essentially on variable length representation scheme and set modified proposed genetic operators. To assess effectiveness approach, it cues phrase selection compared its performance number ranking which are always applied task. Results Conclusion: results provide experimental evidences the effectiveness

参考文章(83)
Thomas G. Dietterich, Hussein Almuallim, Learning with many irrelevant features national conference on artificial intelligence. pp. 547- 552 ,(1991)
Kenji Kira, Larry A. Rendell, The feature selection problem: traditional methods and a new algorithm national conference on artificial intelligence. pp. 129- 134 ,(1992)
Antonio Bellacicco, Handbook of statistics 2: Classification, pattern recognition and reduction of dimensionality European Journal of Operational Research. ,vol. 15, pp. 414- 415 ,(1984) , 10.1016/0377-2217(84)90120-6
Robert Tibshirani, Trevor Hastie, Jerome H. Friedman, The Elements of Statistical Learning ,(2001)
Huming Zhu, Licheng Jiao, Jin Pan, Multi-population Genetic Algorithm for Feature Selection Lecture Notes in Computer Science. pp. 480- 487 ,(2006) , 10.1007/11881223_59
M. Hall, Correlation-based Feature Selection for Machine Learning PhD Thesis, Waikato Univer-sity. ,(1998)
William F. Punch, Richard J. Enbody, Paul D. Hovland, Min Pei, Erik D. Goodman, Lai Chia-Shun, Further Research on Feature Selection and Classification Using Genetic Algorithms international conference on genetic algorithms. pp. 557- 564 ,(1993)
Søren Brunak, Pierre Baldi, Bioinformatics: The Machine Learning Approach ,(1998)
Carlos N. Silla, Gisele L. Pappa, Alex A. Freitas, Celso A. A. Kaestner, Automatic Text Summarization with Genetic Algorithm-Based Attribute Selection ibero-american conference on artificial intelligence. pp. 305- 314 ,(2004) , 10.1007/978-3-540-30498-2_31