Efficient Reinforcement Learning Through Evolving Neural Network Topologies

作者: Risto Miikkulainen , Kenneth O. Stanley

DOI:

关键词:

摘要: Neuroevolution is currently the strongest method on pole-balancing benchmark reinforcement learning tasks. Although earlier studies suggested that there was an advantage in evolving network topology as well connection weights, leading neuroevolution systems evolve fixed networks. Whether structure can improve performance open question. In this article, we introduce such a system, NeuroEvolution of Augmenting Topologies (NEAT). We show when evolved (1) with principled crossover, (2) by protecting structural innovation, and (3) through incremental growth from minimal structure, significantly faster stronger than best fixed-topology methods. NEAT also shows it possible to populations increasingly large genomes, achieving highly complex solutions would otherwise be difficult optimize.

参考文章(24)
David J. Montana, Training feedforward neural networks using genetic algorithms international joint conference on artificial intelligence. pp. 762- 767 ,(1989)
Melanie Mitchell, James P Crutchfield, Rajarshi Das, Evolving Cellular Automata with Genetic Algorithms: A Review of Recent Work ,(2000)
Heinz Mühlenbein, Byoung-Tak Zhang, Evolving Optimal Neural Networks Using Genetic Algorithms with Occam's Razor. Complex Systems. ,vol. 7, ,(1993)
Julian F. Miller, Dominic Job, Vesselin K. Vassilev, Principles in the Evolutionary Design of Digital Circuits—Part II Genetic Programming and Evolvable Machines. ,vol. 1, pp. 259- 288 ,(2000) , 10.1023/A:1010066330916
Alexis P. Wieland, Evolving Controls for Unstable Systems Connectionist Models#R##N#Proceedings of the 1990 Summer School. pp. 91- 102 ,(1991) , 10.1016/B978-1-4832-1448-1.50015-9
Frederic Gruau, Darrell Whitley, Larry Pyeatt, None, A comparison between cellular encoding and direct encoding for genetic neural networks Proceedings of the 1st annual conference on genetic programming. pp. 81- 89 ,(1996)
Samir W. Mahfoud, Niching methods for genetic algorithms University of Illinois at Urbana-Champaign. ,(1996)
Leonid Peshkin, Leslie Pack Kaelbling, Kee-Eung Kim, Nicolas Meuleau, Learning finite-state controllers for partially observable environments uncertainty in artificial intelligence. pp. 427- 436 ,(1999)
J.D. Schaffer, D. Whitley, L.J. Eshelman, Combinations of genetic algorithms and neural networks: a survey of the state of the art [Proceedings] COGANN-92: International Workshop on Combinations of Genetic Algorithms and Neural Networks. pp. 1- 37 ,(1992) , 10.1109/COGANN.1992.273950