Genetic algorithm with different feature selection techniques for anomaly detectors generation

作者: Amira Sayed A Aziz , Ahmad Taher Azar , Mostafa A Salama , Aboul Ella Hassanien , Sanaa El-Ola Hanafy

DOI:

关键词: Feature selectionFeature (computer vision)Computer sciencePattern recognitionTest setData miningIntrusion detection systemSet (abstract data type)Selection (genetic algorithm)Genetic algorithmPrincipal component analysisArtificial intelligence

摘要: Intrusion detection systems have been around for quite some time, to protect from inside ad outside threats. Researchers and scientists are concerned on how enhance the intrusion performance, be able deal with real-time attacks detect them fast quick response. One way improve performance is use minimal number of features define a model in that it can used accurately discriminate normal anomalous behaviour. Many feature selection techniques out there reduce sets or extract new them. In this paper, we propose an anomaly detectors generation approach using genetic algorithm conjunction several techniques, including principle components analysis, sequential floating, correlation-based selection. A Genetic was applied deterministic crowding niching technique, generate set single run. The results show sequential-floating best results, compared others tested, especially floating forward accuracy 92.86% train 85.38% test set.

参考文章(22)
A. Nur Zincir-Heywood, Hilmi Günes Kayacik, Malcolm I. Heywood, Selecting Features for Intrusion Detection: A Feature Relevance Analysis on KDD 99. conference on privacy, security and trust. ,(2005)
Richard Jensen, Rough Set-Based Feature Selection: A Review Information Science Reference. pp. 70- 107 ,(2007) , 10.4018/978-1-59904-552-8.CH003
Hiroshi Motoda, Huan Liu, Lei Yu, Feature Selection with Selective Sampling international conference on machine learning. pp. 395- 402 ,(2002)
A. Murali, M. Rao, A Survey on Intrusion Detection Approaches international conference on information and communication technologies. pp. 233- 240 ,(2005) , 10.1109/ICICT.2005.1598592
David W. Aha, Richard L. Bankert, A Comparative Evaluation of Sequential Feature Selection Algorithms Learning from Data. pp. 199- 206 ,(1996) , 10.1007/978-1-4612-2404-4_19
Pat Langley, Selection of Relevant Features in Machine Learning national conference on artificial intelligence. pp. 1- 5 ,(1994) , 10.21236/ADA292575
Huan Liu, Lei Yu, Feature selection for high-dimensional data: a fast correlation-based filter solution international conference on machine learning. pp. 856- 863 ,(2003)
Amira Sayed A. Aziz, Ahmad Taher Azar, Aboul Ella Hassanien, Sanaa El-Ola Hanafy, Continuous Features Discretization for Anomaly Intrusion Detectors Generation Advances in Intelligent Systems and Computing. pp. 209- 221 ,(2014) , 10.1007/978-3-319-00930-8_19
Fengxi Song, Zhongwei Guo, Dayong Mei, Feature Selection Using Principal Component Analysis international conference on system science, engineering design and manufacturing informatization. ,vol. 1, pp. 27- 30 ,(2010) , 10.1109/ICSEM.2010.14
Jianping Hua, Waibhav D. Tembe, Edward R. Dougherty, Performance of feature-selection methods in the classification of high-dimension data Pattern Recognition. ,vol. 42, pp. 409- 424 ,(2009) , 10.1016/J.PATCOG.2008.08.001