Online Reinforcement Learning for Dynamic Multimedia Systems

DOI: 10.1109/TIP.2009.2035228

关键词: Artificial intelligence 、 Multimedia 、 Stability (learning theory) 、 Robot learning 、 Instance-based learning 、 Probabilistic logic 、 Competitive learning 、 Markov process 、 Computational learning theory 、 Markov decision process 、 Online machine learning 、 Active learning (machine learning) 、 Reinforcement learning 、 Computer science

摘要: … In Section VI-D, we show that the learning performance (including the weighted estimation error and the average reward) can be dramatically improved by smartly updating multiple state…

ieee.org LINK 下载加速

doi.org PDF 下载加速

uni-trier.de PDF 下载加速

sci-hub.se PDF 下载加速

arxiv.org PDF 下载加速

参考文章(33)

Christopher J. C. H. Watkins, Peter Dayan, Technical Note : \cal Q -Learning Machine Learning. ,vol. 8, pp. 279- 292 ,(1992) , 10.1007/BF00992698

Radu Cornea, Nalini Venkatasubramanian, Alex Nicolau, Shivajit Mohapatra, Nikil Dutt, Managing Cross-Layer Constraints for Interactive Mobile Multimedia∗ ,(2003)

D.G. Sachs, S.V. Adve, D.L. Jones, Cross-layer adaptive video coding to reduce energy on general-purpose processors international conference on image processing. ,vol. 3, pp. 109- 112 ,(2003) , 10.1109/ICIP.2003.1247193

A. Schaerf, Y. Shoham, M. Tennenholtz, Adaptive load balancing: a study in multi-agent learning Journal of Artificial Intelligence Research. ,vol. 2, pp. 475- 500 ,(1994) , 10.1613/JAIR.121

Damien Ernst, Arthur Louette, Introduction to Reinforcement Learning MIT Press. ,(1998)

Christopher J.C.H. Watkins, Peter Dayan, Technical Note Q-Learning Machine Learning. ,vol. 8, pp. 279- 292 ,(1992) , 10.1023/A:1022676722315

D. S. Turaga, Hierarchical Modeling of Variable Bit Rate Video Sources PV 2001. pp. 22- 31 ,(2001)

O.F. Rana, J.O. Kephart, Building Effective Multivendor Autonomic Computing Systems IEEE Distributed Systems Online. ,vol. 7, pp. 3- 3 ,(2006) , 10.1109/MDSO.2006.53

Shivajit Mohapatra, Radu Cornea, Nikil Dutt, Alex Nicolau, Nalini Venkatasubramanian, Integrated power management for video streaming to mobile handheld devices acm multimedia. pp. 582- 591 ,(2003) , 10.1145/957013.957134

10.

E. Akyol, M. van der Schaar, Complexity Model Based Proactive Dynamic Voltage Scaling for Video Decoding Systems IEEE Transactions on Multimedia. ,vol. 9, pp. 1475- 1492 ,(2007) , 10.1109/TMM.2007.906563

Online Reinforcement Learning for Dynamic Multimedia Systems

来源期刊

我的账户

Online Reinforcement Learning for Dynamic Multimedia Systems

来源期刊

相似文章 10

我的账户