作者: D.E. Koulouriotis , A. Xanthopoulos
DOI: 10.1016/J.AMC.2007.07.043
关键词: Genetic algorithm 、 Softmax function 、 Thompson sampling 、 Artificial intelligence 、 Action selection 、 Mathematics 、 Reinforcement learning 、 Multi-armed bandit 、 Probability matching 、 Evolutionary algorithm
摘要: … itself in the face of evolutionary algorithms. We present an evolutionary algorithm that was implemented to solve the non-stationary bandit problem along with ad hoc solution algorithms, …