作者: Minwoo Lee , Charles W Anderson , None
关键词: Computer science 、 Sampling (statistics) 、 Relevance (information retrieval) 、 Reinforcement learning 、 Surface (mathematics) 、 Function approximation 、 Action (philosophy) 、 Kernel (linear algebra) 、 Space (mathematics) 、 Artificial intelligence
摘要: To be applicable to real world problems, much reinforcement learning (RL) research has focused on continuous state spaces with function approximations. Some problems also require actions, but searching for good actions in a action space is problematic. This paper suggests novel relevance vector sampling approach search an RL framework machines (RVM-RL). We hypothesize that each (RV) placed the modes of value approximation surface as converges. From hypothesis, we select RVs maximize estimated state-action values. report efficiency proposed by controlling simulated octopus arm RV-sampled actions.