A niching genetic programming-based multi-objective algorithm for hybrid data classification

作者: Marconi de Arruda Pereira , Clodoveu Augusto Davis Júnior , Eduardo Gontijo Carrano , João Antônio De Vasconcelos , None

DOI: 10.1016/J.NEUCOM.2013.12.048

关键词:

摘要: This paper introduces a multi-objective algorithm based on genetic programming to extract classification rules in databases composed of hybrid data, i.e., regular (e.g. numerical, logical, and textual) non-regular geographical) attributes. employs niche technique combined with population archive order identify the that are more suitable for classifying items amongst classes given data set. The is implemented such way user can choose function set adequate application. feature makes proposed approach virtually applicable any kind problem. Besides, problem modeled as one, which maximization accuracy minimization classifier complexity considered objective functions. A different problems, considerably sets domains, has been considered: wines, patients hepatitis, incipient faults power transformers level development cities. In this last set, some attributes geographical, they expressed points, lines or polygons. effectiveness compared three other methods, widely employed classification: Decision Tree (C4.5), Support Vector Machine (SVM) Radial Basis Function (RBF). Statistical comparisons have conducted employing one-way ANOVA Tukey's tests, provide reliable comparison methods. results show achieved better all tested instances, what suggests it considerable range applications.

参考文章(47)
Max J. Egenhofer, A model for detailed binary topological relationships Geoinformatica. ,vol. 47, pp. 261- 273 ,(2019)
Shamkant B. Navathe, Ramez Elmasri, Fundamentals of Database Systems, 5th Edition ,(2006)
Martin Ester, Alexander Frommelt, Hans-Peter Kriegel, Jöorg Sander, Spatial Data Mining: Database Primitives, Algorithms and Efficient DBMS Support Data Mining and Knowledge Discovery. ,vol. 4, pp. 193- 216 ,(2000) , 10.1023/A:1009843930701
Mark A. Hall, Ian H. Witten, Eibe Frank, Data Mining: Practical Machine Learning Tools and Techniques ,(1999)
Yi Lin, Support Vector Machines and the Bayes Rule in Classification Data Mining and Knowledge Discovery. ,vol. 6, pp. 259- 275 ,(2002) , 10.1023/A:1015469627679
Marconi de Arruda Pereira, Clodoveu Augusto Davis Júnior, João Antônio de Vasconcelos, A Niched Genetic Programming Algorithm for Classification Rules Discovery in Geographic Databases Lecture Notes in Computer Science. pp. 260- 269 ,(2010) , 10.1007/978-3-642-17298-4_27
Lizhen Wang, Lihua Zhou, Joan Lu, Jim Yip, An order-clique-based approach for mining maximal co-locations Information Sciences. ,vol. 179, pp. 3370- 3382 ,(2009) , 10.1016/J.INS.2009.05.023