Tracking a small set of experts by mixing past posteriors

作者: Olivier Bousquet , Manfred K. Warmuth

DOI: 10.1007/3-540-44581-1_3

关键词: Binary logarithmPartition (database)WeightConcept driftOpen problemSequenceSmall setComputer scienceAlgorithmEncoding (memory)

摘要: In this paper, we examine on-line learning problems in which the target concept is allowed to change over time. each trial a master algorithm receives predictions from large set of n experts. Its goal predict almost as well best sequence such experts chosen off-line by partitioning training into k+1 sections and then choosing expert for section. We build on methods developed Herbster Warmuth consider an open problem posed Freund where partition are small pool size m. Since k >> m, shifts back forth between pool. propose algorithms that solve mixing past posteriors maintained algorithm. relate number bits needed encoding loss bounds algorithms. Instead paying log section first pay (n choose m) identifying m per new also twice boundaries sections.

参考文章(23)
Jyrki Kivinen, Manfred K. Warmuth, Averaging Expert Predictions european conference on computational learning theory. pp. 153- 167 ,(1999) , 10.1007/3-540-49097-3_13
Peter Auer, Claudio Gentile, Adaptive and Self-Confident On-Line Learning Algorithms conference on learning theory. pp. 107- 117 ,(2000)
Peter Auer, Manfred K. Warmuth, Tracking the Best Disjunction Machine Learning. ,vol. 32, pp. 127- 150 ,(1998) , 10.1023/A:1007472513967
Tracking the best linear predictor Journal of Machine Learning Research. ,vol. 1, pp. 281- 309 ,(2001) , 10.1162/153244301753683726
Mark Herbster, Manfred K. Warmuth, Tracking the Best Expert Machine Learning. ,vol. 32, pp. 151- 178 ,(1998) , 10.1023/A:1007424614876
Nicolò Cesa-Bianchi, Yoav Freund, David Haussler, David P. Helmbold, Robert E. Schapire, Manfred K. Warmuth, How to use expert advice Journal of the ACM. ,vol. 44, pp. 427- 485 ,(1997) , 10.1145/258128.258179
V. G. Vovk, A game of prediction with expert advice conference on learning theory. ,vol. 56, pp. 51- 60 ,(1995) , 10.1145/225298.225304
Jyrki Kivinen, Manfred K. Warmuth, Additive versus exponentiated gradient updates for linear prediction symposium on the theory of computing. pp. 209- 218 ,(1995) , 10.1145/225058.225121
V. Vovk, Derandomizing stochastic prediction strategies Proceedings of the tenth annual conference on Computational learning theory - COLT '97. ,vol. 35, pp. 32- 44 ,(1997) , 10.1145/267460.267473