DOI: 10.1016/B978-1-55860-200-7.50076-3
关键词: Aliasing (computing) 、 Task (project management) 、 Reinforcement learning 、 Variable (computer science) 、 Computer science 、 Robot 、 Variation (game tree) 、 Expected utility hypothesis 、 Artificial intelligence 、 Modularity (networks) 、 State vector
摘要: … of Q-learning that allows the modular architecture to reduce the effects of perceptual aliasing on reward estimation. Q-learning … actions that achieve a state in GNN · is the likelihood ratio …