Boosting SVM Classifiers with Logistic Regression

作者: Yuan-chin Ivan Chang

DOI:

关键词:

摘要: The support vector machine classifier is a linear maximum margin classifier. It performs very well in many classification applications. Although, it could be extended to nonlinear cases by exploiting the idea of kernel, might still suffer from heterogeneity training examples. Since there are few theories literature guide us on how choose kernel functions, selection usually based try-and-error manner. When set imbalanced, data not separable feature space defined chosen kernel. In this paper, we propose hybrid method integrating “small” classifiers logistic regression models. By appropriately partitioning set, ensemble can improve performance SVM trained with whole examples at time. With method, only avoid difficulty heterogeneity, but also have probability outputs for all Moreover, less ambiguous than combined voting schemes. From our simulation studies and some empirical results, find that these kinds robust following sense: (1) improves (prediction accuracy) when kind examples; (2) least as good original classifier, actually no presented We apply multi-class problems replacing binary models polychotomous model constructed individual

参考文章(15)
Nathalie Japkowicz, Learning from Imbalanced Data Sets: A Comparison of Various Strategies * International Workshop on Learning from Imbalanced Data Sets. ,(2000)
Peter McCullagh, John Ashworth Nelder, Generalized Linear Models ,(1983)
Foster Provost, Tom Fawcett, Robust classification systems for imprecise environments national conference on artificial intelligence. pp. 706- 713 ,(1998)
Miroslav Kubat, Robert Holte, Stan Matwin, Learning When Negative Examples Abound european conference on machine learning. pp. 146- 153 ,(1997) , 10.1007/3-540-62858-4_79
Usama M. Fayyad, Paul S. Bradley, Refining Initial Points for K-Means Clustering international conference on machine learning. pp. 91- 99 ,(1998)
Georg Heinze, Michael Schemper, A solution to the problem of separation in logistic regression Statistics in Medicine. ,vol. 21, pp. 2409- 2419 ,(2002) , 10.1002/SIM.1047
Trevor Hastie, Robert Tibshirani, Classification by pairwise coupling Annals of Statistics. ,vol. 26, pp. 451- 471 ,(1998) , 10.1214/AOS/1028144844
KAGAN TUMER, JOYDEEP GHOSH, Error Correlation and Error Reduction in Ensemble Classifiers Connection Science. ,vol. 8, pp. 385- 404 ,(1996) , 10.1080/095400996116839
DAVID FIRTH, Bias reduction of maximum likelihood estimates Biometrika. ,vol. 80, pp. 27- 38 ,(1993) , 10.1093/BIOMET/80.1.27
C. L. Blake, UCI Repository of machine learning databases www.ics.uci.edu/〜mlearn/MLRepository.html. ,(1998)