A Simple Extension of Stability Feature Selection

作者: A. Beinrucker , Ü. Dogan , G. Blanchard

DOI: 10.1007/978-3-642-32717-9_26

关键词:

摘要: Stability selection [9] is a general principle for performing feature selection. It functions as meta-layer on top of “baseline” method, and consists in repeatedly applying the baseline to random data subsamples half-size, finally outputting features with frequency larger than fixed threshold. In present work, we suggest study simple extension original stability method submatrices matrix X given size returning those having largest frequency. We analyze from theoretical point view effect this subsampling selected variables, particular influence subsample size. report experimental results large-dimension artificial real identify which settings be recommended.

参考文章(11)
Robert E. Schapire, Yoram Singer, Improved boosting algorithms using confidence-rated predictions conference on learning theory. ,vol. 37, pp. 80- 91 ,(1998) , 10.1145/279943.279960
Faisal Zaman, Hideo Hirose, Effect of Subsampling Rate on Subbagging and Related Ensembles of Stable Classifiers Lecture Notes in Computer Science. pp. 44- 49 ,(2009) , 10.1007/978-3-642-11164-8_8
Y. Lecun, L. Bottou, Y. Bengio, P. Haffner, Gradient-based learning applied to document recognition Proceedings of the IEEE. ,vol. 86, pp. 2278- 2324 ,(1998) , 10.1109/5.726791
Curt Breneman, Mark Embrechts, Jinbo Bi, Kristin Bennett, Minghu Song, Dimensionality reduction via sparse support vector machines Journal of Machine Learning Research. ,vol. 3, pp. 1229- 1243 ,(2003)
Rajen D. Shah, Richard J. Samworth, Variable selection with error control: another look at stability selection Journal of The Royal Statistical Society Series B-statistical Methodology. ,vol. 75, pp. 55- 80 ,(2013) , 10.1111/J.1467-9868.2011.01034.X
François Fleuret, Fast Binary Feature Selection with Conditional Mutual Information Journal of Machine Learning Research. ,vol. 5, pp. 1531- 1555 ,(2004)
Gerard Escudero, Lluís Màrquez, German Rigau, Boosting applied to Word Sense Disambiguation european conference on machine learning. pp. 129- 141 ,(2000) , 10.1007/3-540-45164-1_14
Machine Learning: ECML 2000 Springer Berlin Heidelberg. ,(2000) , 10.1007/3-540-45164-1
Nicolai Meinshausen, Peter Bühlmann, Discussion on "Stability Selection" by Meinshausen and Buhlmann Journal of the royal statistical society series b-methodological. ,vol. 72, pp. 413- 417 ,(2010) , 10.1111/J.1467-9868.2010.00740.X
Leo Breiman, Random Forests Machine Learning archive. ,vol. 45, pp. 5- 32 ,(2001) , 10.1023/A:1010933404324