Tight worst-case loss bounds for predicting with expert advice

作者: David Haussler , Jyrki Kivinen , Manfred K. Warmuth

DOI: 10.1007/3-540-59119-2_169

关键词: Binary numberCombinatoricsUpper and lower boundsSequenceLogarithmProbability measureSquare (algebra)Binary logarithmMathematicsFunction (mathematics)Algorithm

摘要: We consider on-line algorithms for predicting binary or continuous-valued outcomes, when the algorithm has available predictions made by N experts. For a sequence of trials, we compute total losses both and experts under loss function. At end trial sequence, compare to best expert, i.e., expert with least on particular sequence. show that large class functions, outcomes proposed Vovk exceeds at most amount c ln N, where is constant determined This upper bound does not depend any assumptions how experts'' are generated, can be arbitrarily long. give straightforward method finding correct value lower this c, asymptotically tight. The based probabilistic adversary argument. functions which holds includes square loss, logarithmic Hellinger loss. also another including absolute have an Omega((l log N)^(1/2)) bound, l number trials. Vovk''s achieves same worst-case bounds as outcomes. earlier achieved using slightly more complicated algorithm.

参考文章(17)
Nicholas Littlestone, Philip M. Long, Manfred K. Warmuth, On-line learning of linear functions symposium on the theory of computing. pp. 465- 475 ,(1991) , 10.1145/103418.103467
M.J. Weinberger, N. Merhav, M. Feder, Optimal sequential probability assignment for individual sequences international symposium on information theory. ,vol. 40, pp. 384- 396 ,(1994) , 10.1109/18.312161
Volodimir G. Vovk, Aggregating strategies conference on learning theory. pp. 371- 386 ,(1990) , 10.5555/92571.92672
Jyrki Kivinen, Manfred K. Warmuth, Using experts for predicting continuous outcomes european conference on computational learning theory. pp. 109- 120 ,(1994)
Nicolo Cesa-Bianchi, Manfred K. Warmuth, Philip M. Long, WORST-CASE QUADRATIC LOSS BOUNDS FOR ON-LINE PREDICTION OF LINEAR FUNCTIONS BY GRADIENT DESCENT University of California at Santa Cruz. ,(1993)
Nicolò Cesa-Bianchi, Yoav Freund, David P. Helmbold, Manfred K. Warmuth, On-line prediction and conversion strategies european conference on computational learning theory. ,vol. 25, pp. 205- 216 ,(1994) , 10.1023/A:1018348209754
A. DeSantis, G. Markowsky, M.N. Wegman, Learning probabilistic prediction functions [Proceedings 1988] 29th Annual Symposium on Foundations of Computer Science. pp. 110- 119 ,(1988) , 10.1109/SFCS.1988.21929
V.G. Vovk, Universal forecasting algorithms Information & Computation. ,vol. 96, pp. 245- 277 ,(1992) , 10.1016/0890-5401(92)90050-P