作者: Satinder P. Singh , David Cohn
DOI:
关键词: Dynamic programming 、 Computer science 、 Mathematical optimization 、 Markov decision process 、 Partially observable Markov decision process 、 Merge (version control)
摘要: We are frequently called upon to perform multiple tasks that compete for our attention and resource. Often we know the optimal solution each task in isolation; this paper, describe how knowledge can be exploited efficiently find good solutions doing parallel. formulate problem as of dynamically merging Markov decision processes (MDPs) into a composite MDP, present new theoretically-sound dynamic programming algorithm finding an policy MDP. analyze various aspects illustrate its use on simple problem.