Survey on Feature Selection for Data Reduction

作者: R. K.Bania

DOI: 10.5120/16456-2390

关键词: Data miningData collectionData reductionRowFeature selectionComputer scienceReduction (complexity)Training setKnowledge extraction

摘要: The storage capabilities and advanced in data collection has led to an information load the size of databases increases dimensions, not only rows but also columns. Data reduction (DR) plays a vital role as prepossessing techniques area knowledge discovery from huge data. Feature selection (FS) is one well known techniques, which deals with attributes original without affecting main content. Based on training used for different applications discovery, FS technique falls into supervised, unsupervised. In this paper extensive survey supervised describing searching approach, methods application areas outline comparative study covered.

参考文章(63)
Huan Liu, Zheng Zhao, Semi-supervised Feature Selection via Spectral Analysis. siam international conference on data mining. pp. 641- 646 ,(2007)
Sanjay Chawla, Feature Selection, Association Rules Network and Theory Building Feature Selection in Data Mining. pp. 14- 21 ,(2010)
Rudy Setiono, Huan Liu, A probabilistic approach to feature selection - a filter solution international conference on machine learning. pp. 319- 327 ,(1996)
Mohamed Amir Esseghir, Effective Wrapper-Filter hybridization through GRASP Schemata. Feature Selection in Data Mining. pp. 45- 54 ,(2010)
Rudy Setiono, Huan Liu, Feature selection and classification - a probabilistic wrapper approach industrial and engineering applications of artificial intelligence and expert systems. pp. 419- 424 ,(1996)
Edda Leopold, Jörg Kindermann, Text Categorization with Support Vector Machines. How to Represent Texts in Input Space Machine Learning. ,vol. 46, pp. 423- 444 ,(2002) , 10.1023/A:1012491419635
Marko Robnik-Šikonja, Igor Kononenko, Theoretical and Empirical Analysis of ReliefF and RReliefF Machine Learning. ,vol. 53, pp. 23- 69 ,(2003) , 10.1023/A:1025667309714
Steven L. Salzberg, Alberto Segre, Programs for Machine Learning ,(1994)
Petr Somol, Jana Novovičová, Pavel Pudil, Flexible-Hybrid Sequential Floating Search in Statistical Feature Selection Lecture Notes in Computer Science. pp. 632- 639 ,(2006) , 10.1007/11815921_69
J. Pintér, A. A. Zhigli︠a︡vskiĭ, Theory of Global Random Search ,(1991)