Regularization in skewed binary classification

作者: Sauchi Stephen Lee

DOI: 10.1007/S001800050018

关键词:

摘要: Skewed binary classification concerns the assignment of a new unknown object to one two populations, 0 or 1, on basis q-dimensional vector x = (x1, …xq), where for example population 0, is prevalent class. Assignment rules are developed from learning samples known objects, that is, objects come each populations. Since 1 rare class, overfitting and generalization problems arise easily many models. We propose an effective solution by assigning more weights class 1. The idea produce noisy replicates cases while keeping dominant unchanged. models considered are: nearest neighbor method, neural networks, trees, quadratic discriminant. Noisy replication was applied three real world simulated data sets. Encouraging results were obtained all considered.

参考文章(10)
Richard A Olshen, Charles J Stone, Leo Breiman, Jerome H Friedman, Classification and regression trees ,(1983)
YUVAL RAVIV, NATHAN INTRATOR, Bootstrapping with Noise: An Effective Regularization Technique Connection Science. ,vol. 8, pp. 355- 372 ,(1996) , 10.1080/095400996116811
A. Mkhadri, G. Celeux, A. Nasroallah, Regularization in discriminant analysis: an overview Computational Statistics & Data Analysis. ,vol. 23, pp. 403- 423 ,(1997) , 10.1016/S0167-9473(96)00043-6
Jocelyn Sietsma, Robert J.F. Dow, Creating artificial neural networks that generalize Neural Networks. ,vol. 4, pp. 67- 79 ,(1991) , 10.1016/0893-6080(91)90033-2
Chris M. Bishop, Training with noise is equivalent to Tikhonov regularization Neural Computation. ,vol. 7, pp. 108- 116 ,(1995) , 10.1162/NECO.1995.7.1.108
N. L. Hjort, Brian D. Ripley, Pattern recognition and neural networks ,(1996)
Anders Krogh, Richard G. Palmer, John Hertz, Introduction To The Theory Of Neural Computation ,(1991)
W. N. Venables, B. D. Ripley, Modern Applied Statistics with S-Plus. Biometrics. ,vol. 52, pp. 1528- ,(1996) , 10.2307/2532871
Leo Breiman, Bagging predictors Machine Learning archive. ,vol. 24, pp. 123- ,(1996) , 10.1023/A:1018054314350