Multi-Agent Reinforcement Learning: A Survey

作者: Lucian Busoniu , Robert Babuska , Bart De Schutter

DOI: 10.1109/ICARCV.2006.345353

关键词:

摘要: Multi-agent systems are rapidly finding applications in a variety of domains, including robotics, distributed control, telecommunications, economics. Many tasks arising these domains require that the agents learn behaviors online. A significant part research on multi-agent learning concerns reinforcement techniques. However, due to different viewpoints central issues, such as formal statement goal, large number methods and approaches have been introduced. In this paper we aim present an integrated survey field. First, issue goal is discussed, after which representative selection algorithms reviewed. Finally, open issues identified future directions outlined

参考文章(35)
Manuela Veloso, Michael Bowling, Multiagent learning in the presence of agents with limitations Carnegie Mellon University. ,(2003)
Craig Boutilier, Planning, Learning and Coordination in Multiagent Decision Processes theoretical aspects of rationality and knowledge. pp. 195- 210 ,(1996)
José M. Vidal, Learning in multiagent systems: an introduction from a game-theoretic perspective Lecture Notes in Computer Science. ,vol. 2636, pp. 202- 215 ,(2003) , 10.1007/3-540-44826-8_13
Christopher J.C.H. Watkins, Peter Dayan, Technical Note Q-Learning Machine Learning. ,vol. 8, pp. 279- 292 ,(1992) , 10.1023/A:1022676722315
Martin A. Riedmiller, Martin Lauer, An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems international conference on machine learning. pp. 535- 542 ,(2000)
M Kearns, Y Mansour, S Singh, Nash convergence of gradient dynamics in general-sum games uncertainty in artificial intelligence. pp. 541- 548 ,(2000)
David Carmel, Shaul Markovitch, Opponent Modeling in Multi-Agent Systems international joint conference on artificial intelligence. pp. 40- 52 ,(1995) , 10.1007/3-540-60923-7_18
Nobuo Suematsu, Akira Hayashi, A multiagent reinforcement learning algorithm using extended optimal response Proceedings of the first international joint conference on Autonomous agents and multiagent systems part 1 - AAMAS '02. pp. 370- 377 ,(2002) , 10.1145/544741.544831
Michael L. Littman, Value-function reinforcement learning in Markov games Cognitive Systems Research. ,vol. 2, pp. 55- 66 ,(2001) , 10.1016/S1389-0417(01)00015-8
Karl Tuyls, Ann Nowé, Evolutionary game theory and multi-agent reinforcement learning Knowledge Engineering Review. ,vol. 20, pp. 63- 90 ,(2005) , 10.1017/S026988890500041X