Tracking a small set of experts by mixing past posteriors

作者： Olivier Bousquet , Manfred K. Warmuth

关键词: Binary logarithm 、 Partition (database) 、 Weight 、 Concept drift 、 Open problem 、 Sequence 、 Small set 、 Computer science 、 Algorithm 、 Encoding (memory)

摘要: In this paper, we examine on-line learning problems in which the target concept is allowed to change over time. each trial a master algorithm receives predictions from large set of n experts. Its goal predict almost as well best sequence such experts chosen off-line by partitioning training into k+1 sections and then choosing expert for section. We build on methods developed Herbster Warmuth consider an open problem posed Freund where partition are small pool size m. Since k >> m, shifts back forth between pool. propose algorithms that solve mixing past posteriors maintained algorithm. relate number bits needed encoding loss bounds algorithms. Instead paying log section first pay (n choose m) identifying m per new also twice boundaries sections.

参考文章(23)

Jyrki Kivinen, Manfred K. Warmuth, Averaging Expert Predictions european conference on computational learning theory. pp. 153- 167 ,(1999) , 10.1007/3-540-49097-3_13

Peter Auer, Claudio Gentile, Adaptive and Self-Confident On-Line Learning Algorithms conference on learning theory. pp. 107- 117 ,(2000)

Peter Auer, Manfred K. Warmuth, Tracking the Best Disjunction Machine Learning. ,vol. 32, pp. 127- 150 ,(1998) , 10.1023/A:1007472513967

Tracking the best linear predictor Journal of Machine Learning Research. ,vol. 1, pp. 281- 309 ,(2001) , 10.1162/153244301753683726

Mark Herbster, Manfred K. Warmuth, Tracking the Best Expert Machine Learning. ,vol. 32, pp. 151- 178 ,(1998) , 10.1023/A:1007424614876

Nicolò Cesa-Bianchi, Yoav Freund, David Haussler, David P. Helmbold, Robert E. Schapire, Manfred K. Warmuth, How to use expert advice Journal of the ACM. ,vol. 44, pp. 427- 485 ,(1997) , 10.1145/258128.258179

V. G. Vovk, A game of prediction with expert advice conference on learning theory. ,vol. 56, pp. 51- 60 ,(1995) , 10.1145/225298.225304

Michael R. Garey, David S. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness ,(1979)

Jyrki Kivinen, Manfred K. Warmuth, Additive versus exponentiated gradient updates for linear prediction symposium on the theory of computing. pp. 209- 218 ,(1995) , 10.1145/225058.225121

10.

V. Vovk, Derandomizing stochastic prediction strategies Proceedings of the tenth annual conference on Computational learning theory - COLT '97. ,vol. 35, pp. 32- 44 ,(1997) , 10.1145/267460.267473

Tracking a small set of experts by mixing past posteriors

来源期刊

我的账户

Tracking a small set of experts by mixing past posteriors

来源期刊

相似文章 10

我的账户