Adaptive game playing using multiplicative weights

作者: Yoav Freund , Robert E. Schapire

DOI: 10.1006/GAME.1999.0738

关键词: StrategyMathematicsGame treeExample of a game without a valueMathematical economicsRepeated gameAlgorithmAlgorithmic game theorySymmetric gameNormal-form gameSequential game

摘要: Abstract We present a simple algorithm for playing repeated game. show that player using this suffers average loss is guaranteed to come close the minimum achievable by any fixed strategy. Our bounds are nonasymptotic and hold opponent. The algorithm, which uses multiplicative-weight methods of Littlestone Warmuth, analyzed Kullback–Liebler divergence. This analysis yields new, proof min–max theorem, as well provable method approximately solving A variant our game-playing proved be optimal in very strong sense. Journal Economic Literature Classification Numbers: C44, C70, D83.

参考文章(33)
Philip Klein, Neal Young, On the Number of Iterations for Dantzig-Wolfe Optimization and Packing-Covering Approximation Algorithms integer programming and combinatorial optimization. pp. 320- 327 ,(1999) , 10.1007/3-540-48777-8_24
David P. Helmbold, Robert E. Schapire, Yoram Singer, Manfred K. Warmuth, ON-LINE PORTFOLIO SELECTION USING MULTIPLICATIVE UPDATES Mathematical Finance. ,vol. 8, pp. 325- 347 ,(1998) , 10.1111/1467-9965.00058
Nicolò Cesa-Bianchi, Yoav Freund, David Haussler, David P. Helmbold, Robert E. Schapire, Manfred K. Warmuth, How to use expert advice Journal of the ACM. ,vol. 44, pp. 427- 485 ,(1997) , 10.1145/258128.258179
David Blackwell, An analog of the minimax theorem for vector payoffs. Pacific Journal of Mathematics. ,vol. 6, pp. 1- 8 ,(1956) , 10.2140/PJM.1956.6.1
Neal E. Young, Randomized rounding without solving the linear program symposium on discrete algorithms. pp. 170- 178 ,(1995) , 10.5555/313651.313689
V. G. Vovk, A game of prediction with expert advice conference on learning theory. ,vol. 56, pp. 51- 60 ,(1995) , 10.1145/225298.225304
Dean P. Foster, Prediction in the Worst Case Annals of Statistics. ,vol. 19, pp. 1084- 1090 ,(1991) , 10.1214/AOS/1176348140
Jyrki Kivinen, Manfred K. Warmuth, Additive versus exponentiated gradient updates for linear prediction symposium on the theory of computing. pp. 209- 218 ,(1995) , 10.1145/225058.225121
Henri Frédéric Bohnenblust, Kenneth Joseph Arrow, Contributions to the theory of games ,(1950)
Nicolò Cesa-Bianchi, Yoav Freund, David P. Helmbold, David Haussler, Robert E. Schapire, Manfred K. Warmuth, How to use expert advice Proceedings of the twenty-fifth annual ACM symposium on Theory of computing - STOC '93. pp. 382- 391 ,(1993) , 10.1145/167088.167198