Exact top-k feature selection via l 2,0 -norm constraint

作者: Heng Huang , Feiping Nie , Xiao Cai

DOI:

关键词:

摘要: In this paper, we propose a novel robust and pragmatic feature selection approach. Unlike those sparse learning based methods which tackle the approximate problem by imposing sparsity regularization in objective function, proposed method only has one l2,1-norm loss term with an explicit l2,0-Norm equality constraint. An efficient algorithm on augmented Lagrangian will be derived to solve above constrained optimization find out stable local solution. Extensive experiments four biological datasets show that although our model is not convex problem, it outperforms counterparts state-of-art evaluated terms of classification accuracy two popular classifiers. What more, since parameter meaning, i.e. number selected, avoids burden tuning parameter, making method.

参考文章(25)
Rebecca A. Betensky, Andreas Von Deimling, J. Gregory Cairncross, Catherine L. Nutt, Todd R. Golub, Todd R. Golub, Ute Pohl, Peter M. Black, David N. Louis, Scott L. Pomeroy, Tracy T. Batchelor, Christine Ladd, Margaret E. McLaughlin, Pablo Tamayo, D. R. Mani, Christian Hartmann, Gene expression-based classification of malignant gliomas correlates better with survival than histological classification. Cancer Research. ,vol. 63, pp. 1602- 1607 ,(2003)
Kenji Kira, Larry A. Rendell, A Practical Approach to Feature Selection international conference on machine learning. pp. 249- 256 ,(1992) , 10.1016/B978-1-55860-247-2.50037-1
Ron Kohavi, George H. John, Wrappers for feature subset selection Artificial Intelligence. ,vol. 97, pp. 273- 324 ,(1997) , 10.1016/S0004-3702(97)00043-X
Xiao Cai, Feiping Nie, Heng Huang, Chris Ding, Multi-Class L2,1-Norm Support Vector Machine international conference on data mining. pp. 91- 100 ,(2011) , 10.1109/ICDM.2011.105
Chris Ding, Ding Zhou, Xiaofeng He, Hongyuan Zha, R1-PCA Proceedings of the 23rd international conference on Machine learning - ICML '06. pp. 281- 288 ,(2006) , 10.1145/1143844.1143880
Luis Mancera, Javier Portilla, L0-Norm-Based Sparse Representation Through Alternate Projections international conference on image processing. pp. 2089- 2092 ,(2006) , 10.1109/ICIP.2006.312819
George Forman, Evan Kirshenbaum, Extremely fast text feature extraction for classification and indexing Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08. pp. 1221- 1230 ,(2008) , 10.1145/1458082.1458243
S. P. Fodor, DNA SEQUENCING: Massively Parallel Genomics Science. ,vol. 277, pp. 393- 395 ,(1997) , 10.1126/SCIENCE.277.5324.393
J. D. F. Habbema, J. Hermans, Selection of Variables in Discriminant Analysis by F-statistic and Error Rate Technometrics. ,vol. 19, pp. 487- 493 ,(1977) , 10.1080/00401706.1977.10489590