Feature selection of medical data sets based on RS-RELIEFF

作者: Xiao Liu , Xiaoli Wang , Qiang Su

DOI: 10.1109/ICSSSM.2015.7170275

关键词: Data miningComputer scienceNoise (video)MATLABStatistical classificationReduction (complexity)Feature (computer vision)Feature selectionAlgorithm designArtificial intelligenceData setPattern recognition

摘要: For most of data sets, there exist some redundant, irrelevant and even noise features. Usually, are plenty features in medical sets the correlation among is strong. So, feature selection gets great concern recent years. RELIEFF one effective algorithms, but cannot remove redundant RS a mathematical approach to intelligent analysis can novel RS- algorithm proposed this paper. In RS-RELIEFF, reduction applied set with firstly, then later, new integrative weight each condition will be got end. The was tested sets. experimental results show that RS-RELIEFF has better classification accuracy 71.2644% fewer selected

参考文章(10)
Marko Robnik-Šikonja, Igor Kononenko, Theoretical and Empirical Analysis of ReliefF and RReliefF Machine Learning. ,vol. 53, pp. 23- 69 ,(2003) , 10.1023/A:1025667309714
Kenji Kira, Larry A. Rendell, A Practical Approach to Feature Selection international conference on machine learning. pp. 249- 256 ,(1992) , 10.1016/B978-1-55860-247-2.50037-1
Igor Kononenko, Estimating attributes: analysis and extensions of RELIEF european conference on machine learning. pp. 171- 182 ,(1994) , 10.1007/3-540-57868-4_57
Kemal Polat, Salih Güneş, A new feature selection method on classification of medical datasets: Kernel F-score feature selection Expert Systems With Applications. ,vol. 36, pp. 10367- 10373 ,(2009) , 10.1016/J.ESWA.2009.01.041
Zdzisław Pawlak, Rough sets and intelligent data analysis Information Sciences. ,vol. 147, pp. 1- 12 ,(2002) , 10.1016/S0020-0255(02)00197-4
Yue Huang, Paul J. McCullagh, Norman D. Black, An optimization of ReliefF for classification in large datasets data and knowledge engineering. ,vol. 68, pp. 1348- 1356 ,(2009) , 10.1016/J.DATAK.2009.07.011
C. L. Blake, UCI Repository of machine learning databases www.ics.uci.edu/〜mlearn/MLRepository.html. ,(1998)
Chang-Sik Son, Yoon-Nyun Kim, Hyung-Seop Kim, Hyoung-Seob Park, Min-Soo Kim, Decision-making model for early diagnosis of congestive heart failure using rough set and decision tree approaches Journal of Biomedical Informatics. ,vol. 45, pp. 999- 1008 ,(2012) , 10.1016/J.JBI.2012.04.013
Zdzisław Pawlak, Rough sets Communications of the ACM. ,vol. 38, pp. 88- 95 ,(1995) , 10.1145/219717.219791
Summers S, Grzymala-Busse J, Budihardjo A, Woolery L, The use of machine learning program LERS-LB 2.5 in knowledge acquisition for expert system development in nursing. Computers in Nursing. ,vol. 9, pp. 227- ,(1991)