Changes

Jump to: navigation, search

Deep Learning

551 bytes added, 21:43, 25 September 2018
no edit summary
* [[David Silver]], [[Thomas Hubert]], [[Julian Schrittwieser]], [[Ioannis Antonoglou]], [[Matthew Lai]], [[Arthur Guez]], [[Marc Lanctot]], [[Laurent Sifre]], [[Dharshan Kumaran]], [[Thore Graepel]], [[Timothy Lillicrap]], [[Karen Simonyan]], [[Demis Hassabis]] ('''2017'''). ''Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm''. [https://arxiv.org/abs/1712.01815 arXiv:1712.01815] » [[AlphaZero]]
* [[Shantanu Thakoor]], [[Surag Nair]], [[Megha Jhunjhunwala]] ('''2017'''). ''Learning to Play Othello Without Human Knowledge''. [[Stanford University]], [https://github.com/suragnair/alpha-zero-general/blob/master/pretrained_models/writeup.pdf pdf] » [[AlphaZero]], [[Monte-Carlo Tree Search|MCTS]], [[Othello]] <ref>[https://github.com/suragnair/alpha-zero-general GitHub - suragnair/alpha-zero-general: A clean and simple implementation of a self-play learning algorithm based on AlphaGo Zero (any game, any framework!)]</ref>
* [[Thomas Anthony]], [[Zheng Tian]], [[David Barber]] ('''2017'''). ''Thinking Fast and Slow with Deep Learning and Tree Search''. [https://arxiv.org/abs/1705.08439 arXiv:1705.08439]
'''2018'''
* [[Matthia Sabatelli]], [[Francesco Bidoia]], [[Valeriu Codreanu]], [[Marco Wiering]] ('''2018'''). ''Learning to Evaluate Chess Positions with Deep Neural Networks and Limited Lookahead''. ICPRAM 2018, [http://www.ai.rug.nl/~mwiering/GROUP/ARTICLES/ICPRAM_CHESS_DNN_2018.pdf pdf]
* [[Sai Krishna G.V.]], [[Kyle Goyette]],, [[Ahmad Chamseddine]],, [[Breandan Considine]] ('''2018'''). ''Deep Pepper: Expert Iteration based Chess agent in the Reinforcement Learning Setting''. [https://arxiv.org/abs/1806.00683 arXiv:1806.00683] <ref>[http://www.talkchess.com/forum3/viewtopic.php?f=2&t=67923 Deep Pepper Paper] by Leo, [[CCC]], July 07, 2018</ref>
=Forum Posts=

Navigation menu