Attribute Selection with a Multi-objective Genetic Algorithm

作者: Gisele L. Pappa , Alex A. Freitas , Celso A. A. Kaestner

DOI: 10.1007/3-540-36127-8_27

关键词:

摘要: In this paper we address the problem of multi-objective attribute selection in data mining. We propose a genetic algorithm (GA) based on wrapper approach to discover best subset attributes for given classification algorithm, namely C4.5, well-known decision-tree algorithm. The two objectives be minimized are error rate and size tree produced by C4.5. proposed GA is method sense that it discovers set non-dominated solutions (attribute subsets), according concept Pareto dominance.

参考文章(14)
Hisao Ishibuchi, Tomoharu Nakashima, Multi-objective pattern and feature selection by a genetic algorithm genetic and evolutionary computation conference. pp. 1069- 1076 ,(2000)
Antonin Rozsypal, Miroslav Kubat, Using the Genetic Algorithm to Reduce the Size of a Nearest-Neighbor Classifier and to Select Relevant Attributes international conference on machine learning. pp. 449- 456 ,(2001)
M.J. Martin-Bautista, M.-A. Vila, A survey of genetic feature selection in mining issues congress on evolutionary computation. ,vol. 2, pp. 1314- 1321 ,(1999) , 10.1109/CEC.1999.782599
Alex A. Freitas, Understanding the Crucial Role of AttributeInteraction in Data Mining Artificial Intelligence Review. ,vol. 16, pp. 177- 199 ,(2001) , 10.1023/A:1011996210207
Siddhartha Bhattacharyya, Evolutionary algorithms in data mining: multi-objective performance modeling for direct marketing knowledge discovery and data mining. pp. 465- 473 ,(2000) , 10.1145/347090.347186
J. Ross Quinlan, C4.5: Programs for Machine Learning ,(1992)
Marcel Holsheimer, Arno P.J.M. Siebes, Data Mining: the search for knowledge in databases. Department of Computer Science [CS]. pp. 1- 78 ,(1994)