Feature Selection as State-Space Search: An Empirical Study in Clustering Problems

作者: Julian R. H. Mariño , Levi H. S. Lelis

DOI:

关键词:

摘要: In this paper we treat the problem of feature selection in unsupervised learning as a state-space search problem. We introduce three different heuristic functions and perform extensive experiments on datasets with tens, hundreds, thousands features. Namely, test algorithms using introduce. Our results show that approach for problems can be far superior than traditional baselines such PCA random projections.

参考文章(16)
Hootan Nakhost, Martin Müller, Monte-Carlo exploration for deterministic planning international joint conference on artificial intelligence. pp. 1766- 1771 ,(2009)
Mark Andrew Hall, Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning international conference on machine learning. pp. 359- 366 ,(2000)
Levente Kocsis, Csaba Szepesvári, Bandit Based Monte-Carlo Planning Lecture Notes in Computer Science. pp. 282- 293 ,(2006) , 10.1007/11871842_29
Peter J. Rousseeuw, Silhouettes: a graphical aid to the interpretation and validation of cluster analysis Journal of Computational and Applied Mathematics. ,vol. 20, pp. 53- 65 ,(1987) , 10.1016/0377-0427(87)90125-7
Cecilia M. Procopiuc, Michael Jones, Pankaj K. Agarwal, T. M. Murali, A Monte Carlo algorithm for fast projective clustering Proceedings of the 2002 ACM SIGMOD international conference on Management of data - SIGMOD '02. pp. 418- 427 ,(2002) , 10.1145/564691.564739
William M. Rand, Objective Criteria for the Evaluation of Clustering Methods Journal of the American Statistical Association. ,vol. 66, pp. 846- 850 ,(1971) , 10.1080/01621459.1971.10482356
Scott L. Pomeroy, Pablo Tamayo, Michelle Gaasenbeek, Lisa M. Sturla, Michael Angelo, Margaret E. McLaughlin, John Y. H. Kim, Liliana C. Goumnerova, Peter M. Black, Ching Lau, Jeffrey C. Allen, David Zagzag, James M. Olson, Tom Curran, Cynthia Wetmore, Jaclyn A. Biegel, Tomaso Poggio, Shayan Mukherjee, Ryan Rifkin, Andrea Califano, Gustavo Stolovitzky, David N. Louis, Jill P. Mesirov, Eric S. Lander, Todd R. Golub, Prediction of central nervous system embryonal tumour outcome based on gene expression Nature. ,vol. 415, pp. 436- 442 ,(2002) , 10.1038/415436A
Mauricio G. C. Resende, Celso C. Ribeiro, Greedy Randomized Adaptive Search Procedures Journal of Global Optimization. ,vol. 6, pp. 109- 133 ,(1995) , 10.1007/0-306-48056-5_8
Romaric Gaudel, Michele Sebag, Feature Selection as a One-Player Game international conference on machine learning. pp. 359- 366 ,(2010)
Yuanhong Li, Ming Dong, Yunqian Ma, Feature selection for clustering with constraints using Jensen-Shannon divergence international conference on pattern recognition. pp. 1- 4 ,(2008) , 10.1109/ICPR.2008.4761805