作者: Hitoshi Iima , Yasuaki Kuroe
DOI: 10.1007/978-3-642-10684-2_19
关键词: Artificial intelligence 、 Swarm intelligence 、 Mathematical optimization 、 Multi-swarm optimization 、 Particle swarm optimization 、 Reinforcement learning 、 Swarm behaviour 、 Computer science 、 Reinforcement learning algorithm
摘要: We recently proposed a swarm reinforcement learning algorithm based on particle optimization (PSO) in order to find optimal policies rapidly. In this algorithm, multiple agents are prepared, and they learn not only by individual but also an update procedure of PSO. procedure, state-action values updated the personal best global which found so far. paper, we direct our attention problem that overvaluing bests brings inferior performance. overvalued best, propose PSO each agent has lifespan.