作者: Hengshuai Yao , Ryan Hayward , Chao Gao , Martin Mueller , Shangling Jui
DOI:
关键词:
摘要: The search-based reinforcement learning algorithm AlphaZero has been used as a general method for mastering two-player games Go, chess and Shogi. One crucial ingredient in …