Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach

作者: Justin A. Boyan , Michael L. Littman

DOI:

关键词:

摘要: … " algorithm, related to certain distributed packet routing algorithms [6, 7], learns a routing policy … It does this by experimenting with different routing policies and gathering statistics about …

参考文章(8)
S. B. Thrun, The role of exploration in learning control Handbook of Intelligent Control: Neural, Fuzzy and Adaptive Approaches. ,(1992)
Michael Littman, Justin Boyan, A Distributed Reinforcement Learning Scheme for Network Routing Carnegie Mellon University. pp. 55- 61 ,(1993) , 10.4324/9780203773826-9
Long-Ji Lin, Reinforcement learning for robots using neural networks Carnegie Mellon University. ,(1992)
Lester Randolph Ford, Flows in networks ,(1962)
Gerald Tesauro, Practical Issues in Temporal Difference Learning Machine Learning. ,vol. 8, pp. 257- 277 ,(1992) , 10.1007/BF00992697
Richard Bellman, ON A ROUTING PROBLEM Quarterly of Applied Mathematics. ,vol. 16, pp. 87- 90 ,(1958) , 10.1090/QAM/102435
C. J. C. H. Watkins, Learning from delayed rewards Ph. D thesis, Cambridge University Psychology Department. ,(1989)