Three-Head Neural Network Architecture for Monte Carlo Tree Search.

作者: Chao Gao , Martin Müller , Ryan Hayward

DOI: 10.24963/IJCAI.2018/523

关键词:

摘要:

参考文章(12)
Diederik P. Kingma, Jimmy Ba, Adam: A Method for Stochastic Optimization arXiv: Learning. ,(2014)
Ilya Sutskever, Chris J. Maddison, Aja Huang, David Silver, Move Evaluation in Go Using Deep Convolutional Neural Networks international conference on learning representations. ,(2015)
Sylvain Gelly, David Silver, Monte-Carlo tree search and rapid action value estimation in computer Go Artificial Intelligence. ,vol. 175, pp. 1856- 1875 ,(2011) , 10.1016/J.ARTINT.2011.03.007
Markus Enzenberger, Martin Muller, Broderick Arneson, Richard Segal, Fuego—An Open-Source Framework for Board Games and Go Engine Based on Monte Carlo Tree Search IEEE Transactions on Computational Intelligence and AI in Games. ,vol. 2, pp. 259- 270 ,(2010) , 10.1109/TCIAIG.2010.2083662
Broderick Arneson, Ryan B. Hayward, Philip Henderson, Monte Carlo Tree Search in Hex IEEE Transactions on Computational Intelligence and AI in Games. ,vol. 2, pp. 251- 258 ,(2010) , 10.1109/TCIAIG.2010.2067212
Amos Storkey, Christopher Clark, Training Deep Convolutional Neural Networks to Play Go international conference on machine learning. pp. 1766- 1774 ,(2015)
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, Demis Hassabis, None, Human-level control through deep reinforcement learning Nature. ,vol. 518, pp. 529- 533 ,(2015) , 10.1038/NATURE14236
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Identity Mappings in Deep Residual Networks Computer Vision – ECCV 2016. pp. 630- 645 ,(2016) , 10.1007/978-3-319-46493-0_38
Norman P. Jouppi, Cliff Young, Nishant Patil, David Patterson, Gaurav Agrawal, Raminder Bajwa, Sarah Bates, Suresh Bhatia, Nan Boden, Al Borchers, Rick Boyle, Pierre-luc Cantin, Clifford Chao, Chris Clark, Jeremy Coriell, Mike Daley, Matt Dau, Jeffrey Dean, Ben Gelb, Tara Vazir Ghaemmaghami, Rajendra Gottipati, William Gulland, Robert Hagmann, C. Richard Ho, Doug Hogberg, John Hu, Robert Hundt, Dan Hurt, Julian Ibarz, Aaron Jaffey, Alek Jaworski, Alexander Kaplan, Harshit Khaitan, Daniel Killebrew, Andy Koch, Naveen Kumar, Steve Lacy, James Laudon, James Law, Diemthu Le, Chris Leary, Zhuyuan Liu, Kyle Lucke, Alan Lundin, Gordon MacKean, Adriana Maggiore, Maire Mahony, Kieran Miller, Rahul Nagarajan, Ravi Narayanaswami, Ray Ni, Kathy Nix, Thomas Norrie, Mark Omernick, Narayana Penukonda, Andy Phelps, Jonathan Ross, Matt Ross, Amir Salek, Emad Samadiani, Chris Severn, Gregory Sizikov, Matthew Snelham, Jed Souter, Dan Steinberg, Andy Swing, Mercedes Tan, Gregory Thorson, Bo Tian, Horia Toma, Erick Tuttle, Vijay Vasudevan, Richard Walter, Walter Wang, Eric Wilcox, Doe Hyun Yoon, In-Datacenter Performance Analysis of a Tensor Processing Unit international symposium on computer architecture. ,vol. 45, pp. 1- 12 ,(2017) , 10.1145/3079856.3080246
Chao Gao, Ryan Hayward, Martin Muller, Move Prediction Using Deep Convolutional Neural Networks in Hex IEEE Transactions on Games. ,vol. 10, pp. 336- 343 ,(2018) , 10.1109/TG.2017.2785042