A preliminary study on the reuse of subtrees within decision trees in a genetic programming context for data classification

作者: Emmanuel Dufourq , Nelishia Pillay

DOI: 10.1109/WICT.2013.7113150

关键词:

摘要: Genetic programming (GP) has been successful in creating models for data classification which obtain high accuracies. In a context functions is common practice as this serves way to isolate part of code can be reused. The encapsulation genetic operator capable promoting modularization the sense that encapsulate subtrees reused by GP trees during execution algorithm. Models created problems tend large and certain complexity, thus rendering need modular acquisition methods promote reuse existing order solve problems. effect when solving not previously investigated. Two approaches were proposed, first incorporated with no limitations on how use encapsulated subtrees. second approach made maintained list two proposed tested eight sets results show improved training accuracy nearly every set.

参考文章(10)
Mark A. Hall, Ian H. Witten, Eibe Frank, Data Mining: Practical Machine Learning Tools and Techniques ,(1999)
Witold Pedrycz, Lukasz Andrzej Kurgan, Krzysztof J. Cios, Roman W. Swiniarski, Data Mining: A Knowledge Discovery Approach ,(2007)
Max Bramer, Principles of Data Mining ,(2007)
T Shukla, S Singh, K Naik, Allocation of optimal distributed generation using GA for minimum system losses in radial distribution networks International journal of engineering science and technology. ,vol. 2, pp. 94- 106 ,(2010) , 10.4314/IJEST.V2I3.59178
Salvador Garcia, J. Luengo, José Antonio Sáez, Victoria López, F. Herrera, A Survey of Discretization Techniques: Taxonomy and Empirical Analysis in Supervised Learning IEEE Transactions on Knowledge and Data Engineering. ,vol. 25, pp. 734- 750 ,(2013) , 10.1109/TKDE.2012.35
Huan Liu, Farhad Hussain, Chew Lim Tan, Manoranjan Dash, Discretization: An Enabling Technique Data Mining and Knowledge Discovery. ,vol. 6, pp. 393- 423 ,(2002) , 10.1023/A:1016304305535
P.G. Espejo, S. Ventura, F. Herrera, A Survey on the Application of Genetic Programming to Classification systems man and cybernetics. ,vol. 40, pp. 121- 144 ,(2010) , 10.1109/TSMCC.2009.2033566
Kevin Bache, Moshe Lichman, UCI Machine Learning Repository University of California, School of Information and Computer Science. ,(2007)