Additive models, boosting, and inference for generalized divergences

作者: John Lafferty

DOI: 10.1145/307400.307422

关键词: Pattern recognitionExponential functionApplied mathematicsEntropy (information theory)InferenceAdaBoostQuadratic equationArtificial intelligenceAdditive modelBregman divergenceLegendre transformationMathematics

摘要: We present a framework for designing incremental learning algorithms derived from generalized entropy functionals. Our approach is based on the use of Bregman divergences together with associated class additive models constructed using Legendre transform. A particular one-parameter family shown to yield loss functions that includes log-likelihood criterion logistic regression as special case, and closely approximates exponential used in AdaBoost Schapire et a/., natural parameter varies. also show how quadratic approximation gain divergence results weighted least-squares criterion. This leads builds upon extends recent interpretation boosting terms proposed by Friedman, Hastie, Tibshirani.

参考文章(22)
I. Csiszár, Maxent, Mathematics, and Information Theory Maximum Entropy and Bayesian Methods. pp. 35- 50 ,(1996) , 10.1007/978-94-011-5430-7_5
John Lafferty, Jaime G. Carbonell, Ralf D Brown, Yiming Yang, Xin Liu, Tom Pierce, CMU Report on TDT-2: Segmentation, Detection and Tracking ,(1999)
Doug Beeferman, Adam Berger, John Lafferty, Statistical Models for Text Segmentation Machine Learning. ,vol. 34, pp. 177- 210 ,(1999) , 10.1023/A:1007506220214
Richard A Olshen, Charles J Stone, Leo Breiman, Jerome H Friedman, Classification and regression trees ,(1983)
Jyrki Kivinen, Manfred K. Warmuth, Boosting as entropy projection conference on learning theory. pp. 134- 144 ,(1999) , 10.1145/307400.307424
Y. Censor, A. Lent, An iterative row-action method for interval convex programming Journal of Optimization Theory and Applications. ,vol. 34, pp. 321- 353 ,(1981) , 10.1007/BF00934676
Jyrki Kivinen, Manfred K. Warmuth, Additive versus exponentiated gradient updates for linear prediction symposium on the theory of computing. pp. 209- 218 ,(1995) , 10.1145/225058.225121
Jerome Friedman, Trevor Hastie, Robert Tibshirani, Additive logistic regression: a statistical view of boosting (With discussion and a rejoinder by the authors) Annals of Statistics. ,vol. 28, pp. 337- 407 ,(2000) , 10.1214/AOS/1016218223
Robert E. Schapire, Yoram Singer, Improved boosting algorithms using confidence-rated predictions conference on learning theory. ,vol. 37, pp. 80- 91 ,(1998) , 10.1145/279943.279960