作者: A. Hapfelmeier , K. Ulm
DOI: 10.1016/J.CSDA.2012.09.020
关键词: Control (linguistics) 、 Random forest 、 Mathematics 、 Feature selection 、 Word error rate 、 Machine learning 、 Artificial intelligence 、 Regression 、 Multiple comparisons problem 、 Permutation
摘要: Random Forests are frequently applied as they achieve a high prediction accuracy and have the ability to identify informative variables. Several approaches for variable selection been proposed combine intensify these qualities. An extensive review of corresponding literature led development new approach that is based on theoretical framework permutation tests meets important statistical properties. A comparison another eight popular methods in three simulation studies four real data applications indicated that: can also be used control test-wise family-wise error rate, provides higher power distinguish relevant from irrelevant variables leads models which located among very best performing ones. In addition, it equally applicable regression classification problems.