Lower Bounds for Howard’s Algorithm for Finding Minimum Mean-Cost Cycles

作者: Thomas Dueholm Hansen , Uri Zwick

DOI: 10.1007/978-3-642-17517-6_37

关键词:

摘要: Howard’s policy iteration algorithm is one of the most widely used algorithms for finding optimal policies controlling Markov Decision Processes (MDPs). When applied to weighted directed graphs, which may be viewed as Deterministic MDPs (DMDPs), can find Minimum Mean-Cost cycles (MMCC). Experimental studies suggest that works extremely well in this context. The theoretical complexity MMCCs a mystery. No polynomial time bound known on its running time. Prior work, there were only linear lower bounds number iterations performed by algorithm. We provide first graphs performs Ω(n 2) iterations, where n vertices graph.

参考文章(17)
Robert E. Tarjan, Andrew V. Goldberg, Loukas Georgiadis, Renato F. Werneck, An experimental study of minimum mean cycle algorithms algorithm engineering and experimentation. pp. 1- 13 ,(2009)
John Fearnley, Exponential Lower Bounds for Policy Iteration Automata, Languages and Programming. pp. 551- 562 ,(2010) , 10.1007/978-3-642-14162-1_46
Ali Dasdan, Experimental analysis of the fastest optimum cycle ratio and mean algorithms ACM Transactions on Design Automation of Electronic Systems. ,vol. 9, pp. 385- 418 ,(2004) , 10.1145/1027084.1027085
L. R. Ford, D. R. Fulkerson, Maximal Flow Through a Network Canadian Journal of Mathematics. ,vol. 8, pp. 243- 248 ,(1956) , 10.1007/978-0-8176-4842-8_15
Nimrod Megiddo, Combinatorial Optimization with Rational Objective Functions Mathematics of Operations Research. ,vol. 4, pp. 414- 424 ,(1979) , 10.1287/MOOR.4.4.414
Richard M. Karp, A characterization of the minimum cycle mean in a digraph Discrete Mathematics. ,vol. 23, pp. 309- 311 ,(1978) , 10.1016/0012-365X(78)90011-0
Andrew V. Goldberg, Robert E. Tarjan, Finding minimum-cost circulations by canceling negative cycles Journal of the ACM. ,vol. 36, pp. 873- 886 ,(1989) , 10.1145/76359.76368
Nimrod Megiddo, Applying Parallel Computation Algorithms in the Design of Serial Algorithms Journal of the ACM. ,vol. 30, pp. 852- 865 ,(1983) , 10.1145/2157.322410
James B. Orlin, Robert Endre Tarjan, Neal E. Young, Faster Parametric Shortest Path and Minimum Balance Algorithms ,(2011)