Cost-sensitive feature selection via the ℓ2,1-norm

作者: Hong Zhao , Shenglong Yu

DOI: 10.1016/J.IJAR.2018.10.017

关键词: OutlierTotal costComputer scienceNorm (mathematics)Feature vectorCost sensitiveFeature selectionNorm minimizationData mining

摘要: Abstract An essential step in data mining and machine learning is selecting a useful feature subset from the high-dimensional space. Many existing selection algorithms only consider precision, but do not error types test cost. In this paper, we use l 2 , 1 -norm to propose cost-sensitive embedded algorithm that minimizes total cost rather than maximizing accuracy. The with joint minimization of loss function misclassification costs. based costs robust outliers. We also add an orthogonal constraint term guarantee each selected independent. proposed simultaneously takes into account both Finally, iterative updating provided using objective makes more efficient. realistic algorithms. Extensive experimental results on publicly available datasets demonstrate effective, can select low-cost achieve better performance other real-world applications.

参考文章(51)
Mingjie Qian, Chengxiang Zhai, Robust unsupervised feature selection international joint conference on artificial intelligence. pp. 1621- 1627 ,(2013)
Jinhai Li, Yue Ren, Changlin Mei, Yuhua Qian, Xibei Yang, None, A comparative study of multigranulation rough sets and concept lattices via rule acquisition Knowledge Based Systems. ,vol. 91, pp. 152- 164 ,(2016) , 10.1016/J.KNOSYS.2015.07.024
Rudy Setiono, Huan Liu, Feature selection and classification - a probabilistic wrapper approach industrial and engineering applications of artificial intelligence and expert systems. pp. 419- 424 ,(1996)
Quan Zou, Jiancang Zeng, Liujuan Cao, Rongrong Ji, A novel features ranking metric with application to scalable visual and bioinformatics data classification Neurocomputing. ,vol. 173, pp. 346- 354 ,(2016) , 10.1016/J.NEUCOM.2014.12.123
Janez Demšar, Statistical Comparisons of Classifiers over Multiple Data Sets Journal of Machine Learning Research. ,vol. 7, pp. 1- 30 ,(2006)
Daoqiang Zhang, Linsong Miao, Mingxia Liu, Cost-sensitive feature selection with application in software defect prediction international conference on pattern recognition. pp. 967- 970 ,(2012)
Huan Liu, Lei Wang, Zheng Zhao, Efficient spectral feature selection with minimum redundancy national conference on artificial intelligence. pp. 673- 678 ,(2010)
Igor Kononenko, Estimating attributes: analysis and extensions of RELIEF european conference on machine learning. pp. 171- 182 ,(1994) , 10.1007/3-540-57868-4_57
Dongyoon Han, Junmo Kim, Unsupervised Simultaneous Orthogonal basis Clustering Feature Selection computer vision and pattern recognition. pp. 5016- 5023 ,(2015) , 10.1109/CVPR.2015.7299136