Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion

作者： A. Federgruen , A. Hordijk , H.C. Tijms

关键词:

摘要: Abstract This paper establishes a rather complete optimality theory for the average cost semi-Markov decision model with denumerable state space, compact metric action sets and unbounded one-step costs case where underlying Markov chains have single ergotic set. Under condition which, roughly speaking, requires existence of finite set such that supremum over all stationary policies expected time total absolute incurred until first return to this are any starting state, we shall verify solution equation an optimal policy.

参考文章(17)

A. Federgruen, A. Hordijk, H.C. Tijms, RECURRENCE CONDITIONS IN DENUMERABLE STATE MARKOV DECISION PROCESSES Dynamic Programming and its Applications#R##N#Proceedings of the International Conference on Dynamic Programming and its Applications, University of British Columbia, Vancouver, British Columbia, Canada, April 14–16, 1977. pp. 3- 22 ,(1978) , 10.1016/B978-0-12-568150-6.50007-6

Arie Hordijk, Dynamic programming and Markov potential theory ,(1974)

Arie Hordijk, Regenerative Markov decision models Mathematical Programming Studies. pp. 49- 72 ,(1976) , 10.1007/BFB0120744

Cyrus Derman, Ralph E. Strauch, A Note on Memoryless Rules for Controlling Sequential Control Processes Annals of Mathematical Statistics. ,vol. 37, pp. 276- 278 ,(1966) , 10.1214/AOMS/1177699618

Jacob Wijngaard, Stationary Markovian Decision Problems and Perturbation Theory of Quasi-Compact Linear Operators Mathematics of Operations Research. ,vol. 2, pp. 91- 102 ,(1977) , 10.1287/MOOR.2.1.91

Sheldon M. Ross, Applied Probability Models with Optimization Applications ,(1970)

Howard M. Taylor, Markovian sequential replacement processes ,(1965)

Lloyd Fisher, Sheldon M. Ross, An Example in Denumerable Decision Processes Annals of Mathematical Statistics. ,vol. 39, pp. 674- 675 ,(1968) , 10.1214/AOMS/1177698426

Steven A. Lippman, On Dynamic Programming with Unbounded Rewards Management Science. ,vol. 21, pp. 1225- 1233 ,(1975) , 10.1287/MNSC.21.11.1225

10.

A Hordijk, van Km Kees Hee, van der J Jan Wal, Successive approximations for convergent dynamic programming Stichting Mathematisch Centrum. pp. 183- 211 ,(1977)

Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion

来源期刊

我的账户

Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion

来源期刊

相似文章 10

我的账户