Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion

作者: A. Federgruen , A. Hordijk , H.C. Tijms

DOI: 10.1016/0304-4149(79)90034-6

关键词:

摘要: Abstract This paper establishes a rather complete optimality theory for the average cost semi-Markov decision model with denumerable state space, compact metric action sets and unbounded one-step costs case where underlying Markov chains have single ergotic set. Under condition which, roughly speaking, requires existence of finite set such that supremum over all stationary policies expected time total absolute incurred until first return to this are any starting state, we shall verify solution equation an optimal policy.

参考文章(17)
A. Federgruen, A. Hordijk, H.C. Tijms, RECURRENCE CONDITIONS IN DENUMERABLE STATE MARKOV DECISION PROCESSES Dynamic Programming and its Applications#R##N#Proceedings of the International Conference on Dynamic Programming and its Applications, University of British Columbia, Vancouver, British Columbia, Canada, April 14–16, 1977. pp. 3- 22 ,(1978) , 10.1016/B978-0-12-568150-6.50007-6
Arie Hordijk, Regenerative Markov decision models Mathematical Programming Studies. pp. 49- 72 ,(1976) , 10.1007/BFB0120744
Cyrus Derman, Ralph E. Strauch, A Note on Memoryless Rules for Controlling Sequential Control Processes Annals of Mathematical Statistics. ,vol. 37, pp. 276- 278 ,(1966) , 10.1214/AOMS/1177699618
Jacob Wijngaard, Stationary Markovian Decision Problems and Perturbation Theory of Quasi-Compact Linear Operators Mathematics of Operations Research. ,vol. 2, pp. 91- 102 ,(1977) , 10.1287/MOOR.2.1.91
Lloyd Fisher, Sheldon M. Ross, An Example in Denumerable Decision Processes Annals of Mathematical Statistics. ,vol. 39, pp. 674- 675 ,(1968) , 10.1214/AOMS/1177698426
Steven A. Lippman, On Dynamic Programming with Unbounded Rewards Management Science. ,vol. 21, pp. 1225- 1233 ,(1975) , 10.1287/MNSC.21.11.1225
A Hordijk, van Km Kees Hee, van der J Jan Wal, Successive approximations for convergent dynamic programming Stichting Mathematisch Centrum. pp. 183- 211 ,(1977)