Classification by ensembles from random partitions of high-dimensional data

作者: Hongshik Ahn , Hojin Moon , Melissa J. Fazzari , Noha Lim , James J. Chen

DOI: 10.1016/J.CSDA.2006.12.043

关键词:

摘要: A robust classification procedure is developed based on ensembles of classifiers, with each classifier constructed from a different set predictors determined by random partition the entire predictors. The proposed methods combine results multiple classifiers to achieve substantially improved prediction compared optimal single classifier. This approach designed specifically for high-dimensional data sets which sought. By combining built subspace predictors, computational advantage in tackling growing problem dimensionality. For we build tree or logistic regression tree. Our study shows, using four real areas, that our perform consistently well widely used methods. unbalanced data, maintains balance between sensitivity and specificity more adequately than many other considered this study.

参考文章(44)
Sebastian Lewandowski, Katarzyna Kalita, Leszek Kaczmarek, Estrogen receptor β: Potential functional significance of a variety of mRNA isoforms FEBS Letters. ,vol. 524, pp. 1- 5 ,(2002) , 10.1016/S0014-5793(02)03015-6
Wei-Yin Loh, Yu-Shan Shih, SPLIT SELECTION METHODS FOR CLASSIFICATION TREES ,(1997)
Sandrine Dudoit, Jane Fridlyand, Classification in microarray experiments Chapman and Hall/CRC. ,(2003) , 10.1201/9780203011232.CH3
James Franklin, The elements of statistical learning : data mining, inference,and prediction The Mathematical Intelligencer. ,vol. 27, pp. 83- 85 ,(2005) , 10.1007/BF02985802
Richard A Olshen, Charles J Stone, Leo Breiman, Jerome H Friedman, Classification and regression trees ,(1983)
William S. Branham, Stacey L. Dial, Carrie L. Moland, Bruce S. Hass, Robert M. Blair, Hong Fang, Leming Shi, Weida Tong, Roger G. Perkins, Daniel M. Sheehan, Phytoestrogens and mycoestrogens bind to the rat uterine estrogen receptor. Journal of Nutrition. ,vol. 132, pp. 658- 664 ,(2002) , 10.1093/JN/132.4.658
Sandrine Dudoit, Jane Fridlyand, Terence P Speed, None, Comparison of discrimination methods for the classification of tumors using gene expression data Journal of the American Statistical Association. ,vol. 97, pp. 77- 87 ,(2002) , 10.1198/016214502753479248
Alan J. Miller, Subset Selection in Regression ,(2002)
J. J. Chen, C. A. Tsai, J. F. Young, R. L. Kodell, Classification ensembles for unbalanced class sizes in predictive toxicology Sar and Qsar in Environmental Research. ,vol. 16, pp. 517- 529 ,(2005) , 10.1080/10659360500468468
Yoav Freund, Robert E Schapire, A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting conference on learning theory. ,vol. 55, pp. 119- 139 ,(1997) , 10.1006/JCSS.1997.1504