Lower Bounds for Howard’s Algorithm for Finding Minimum Mean-Cost Cycles

关键词:

摘要: Howard’s policy iteration algorithm is one of the most widely used algorithms for finding optimal policies controlling Markov Decision Processes (MDPs). When applied to weighted directed graphs, which may be viewed as Deterministic MDPs (DMDPs), can find Minimum Mean-Cost cycles (MMCC). Experimental studies suggest that works extremely well in this context. The theoretical complexity MMCCs a mystery. No polynomial time bound known on its running time. Prior work, there were only linear lower bounds number iterations performed by algorithm. We provide first graphs performs Ω(n 2) iterations, where n vertices graph.

springer.com 本地加速

参考文章(17)

Robert E. Tarjan, Andrew V. Goldberg, Loukas Georgiadis, Renato F. Werneck, An experimental study of minimum mean cycle algorithms algorithm engineering and experimentation. pp. 1- 13 ,(2009)

John Fearnley, Exponential Lower Bounds for Policy Iteration Automata, Languages and Programming. pp. 551- 562 ,(2010) , 10.1007/978-3-642-14162-1_46

Ali Dasdan, Experimental analysis of the fastest optimum cycle ratio and mean algorithms ACM Transactions on Design Automation of Electronic Systems. ,vol. 9, pp. 385- 418 ,(2004) , 10.1145/1027084.1027085

L. R. Ford, D. R. Fulkerson, Maximal Flow Through a Network Canadian Journal of Mathematics. ,vol. 8, pp. 243- 248 ,(1956) , 10.1007/978-0-8176-4842-8_15

Nimrod Megiddo, Combinatorial Optimization with Rational Objective Functions Mathematics of Operations Research. ,vol. 4, pp. 414- 424 ,(1979) , 10.1287/MOOR.4.4.414

Ronald A. Howard, Dynamic Programming and Markov Processes ,(1960)

Richard M. Karp, A characterization of the minimum cycle mean in a digraph Discrete Mathematics. ,vol. 23, pp. 309- 311 ,(1978) , 10.1016/0012-365X(78)90011-0

Andrew V. Goldberg, Robert E. Tarjan, Finding minimum-cost circulations by canceling negative cycles Journal of the ACM. ,vol. 36, pp. 873- 886 ,(1989) , 10.1145/76359.76368

Nimrod Megiddo, Applying Parallel Computation Algorithms in the Design of Serial Algorithms Journal of the ACM. ,vol. 30, pp. 852- 865 ,(1983) , 10.1145/2157.322410

10.

James B. Orlin, Robert Endre Tarjan, Neal E. Young, Faster Parametric Shortest Path and Minimum Balance Algorithms ,(2011)

Lower Bounds for Howard’s Algorithm for Finding Minimum Mean-Cost Cycles

来源期刊

我的账户

Lower Bounds for Howard’s Algorithm for Finding Minimum Mean-Cost Cycles

来源期刊

相似文章 10

我的账户