Three-Head Neural Network Architecture for AlphaZero Learning

作者: Hengshuai Yao , Ryan Hayward , Chao Gao , Martin Mueller , Shangling Jui

DOI:

关键词:

摘要: The search-based reinforcement learning algorithm AlphaZero has been used as a general method for mastering two-player games Go, chess and Shogi. One crucial ingredient in …

参考文章(0)