Online Reinforcement Learning for Dynamic Multimedia Systems

作者: N. Mastronarde , M. van der Schaar

DOI: 10.1109/TIP.2009.2035228

关键词: Artificial intelligenceMultimediaStability (learning theory)Robot learningInstance-based learningProbabilistic logicCompetitive learningMarkov processComputational learning theoryMarkov decision processOnline machine learningActive learning (machine learning)Reinforcement learningComputer science

摘要: … In Section VI-D, we show that the learning performance (including the weighted estimation error and the average reward) can be dramatically improved by smartly updating multiple state…

参考文章(33)
Christopher J. C. H. Watkins, Peter Dayan, Technical Note : \cal Q -Learning Machine Learning. ,vol. 8, pp. 279- 292 ,(1992) , 10.1007/BF00992698
Radu Cornea, Nalini Venkatasubramanian, Alex Nicolau, Shivajit Mohapatra, Nikil Dutt, Managing Cross-Layer Constraints for Interactive Mobile Multimedia∗ ,(2003)
D.G. Sachs, S.V. Adve, D.L. Jones, Cross-layer adaptive video coding to reduce energy on general-purpose processors international conference on image processing. ,vol. 3, pp. 109- 112 ,(2003) , 10.1109/ICIP.2003.1247193
A. Schaerf, Y. Shoham, M. Tennenholtz, Adaptive load balancing: a study in multi-agent learning Journal of Artificial Intelligence Research. ,vol. 2, pp. 475- 500 ,(1994) , 10.1613/JAIR.121
Damien Ernst, Arthur Louette, Introduction to Reinforcement Learning MIT Press. ,(1998)
Christopher J.C.H. Watkins, Peter Dayan, Technical Note Q-Learning Machine Learning. ,vol. 8, pp. 279- 292 ,(1992) , 10.1023/A:1022676722315
D. S. Turaga, Hierarchical Modeling of Variable Bit Rate Video Sources PV 2001. pp. 22- 31 ,(2001)
O.F. Rana, J.O. Kephart, Building Effective Multivendor Autonomic Computing Systems IEEE Distributed Systems Online. ,vol. 7, pp. 3- 3 ,(2006) , 10.1109/MDSO.2006.53
Shivajit Mohapatra, Radu Cornea, Nikil Dutt, Alex Nicolau, Nalini Venkatasubramanian, Integrated power management for video streaming to mobile handheld devices acm multimedia. pp. 582- 591 ,(2003) , 10.1145/957013.957134
E. Akyol, M. van der Schaar, Complexity Model Based Proactive Dynamic Voltage Scaling for Video Decoding Systems IEEE Transactions on Multimedia. ,vol. 9, pp. 1475- 1492 ,(2007) , 10.1109/TMM.2007.906563