Changes

Jump to: navigation, search

Deep Learning

176 bytes added, 19:25, 22 October 2018
no edit summary
* [[Matej Moravčík]], [[Martin Schmid]], [[Neil Burch]], [[Viliam Lisý]], [[Dustin Morrill]], [[Nolan Bard]], [[Trevor Davis]], [[Kevin Waugh]], [[Michael Johanson]], [[Michael Bowling]] ('''2017'''). ''[http://science.sciencemag.org/content/356/6337/508 DeepStack: Expert-level artificial intelligence in heads-up no-limit poker]''. [https://en.wikipedia.org/wiki/Science_(journal) Science], Vol. 356, No. 6337
* [[Tristan Cazenave]] ('''2017'''). ''Improved Policy Networks for Computer Go''. [[Advances in Computer Games 15]], [http://www.lamsade.dauphine.fr/~cazenave/papers/gofairsbn.pdf pdf]
* [[Hirotaka Kameko]], [[Jun Suzuki]], [[Naoki Mizukami]], [[Yoshimasa Tsuruoka]] ('''2017'''). ''Deep Reinforcement Learning with Hidden Layers on Future States''. [[Conferences#IJCA2017IJCAI2017|Computer Games Workshop at CGW@IJCAI 2017]], [http://www.lamsade.dauphine.fr/~cazenave/cgw2017/Kameko.pdf pdf]* [[Keigo Kawamura]], [[Naoki Mizukami]], [[Yoshimasa Tsuruoka]] ('''2017'''). ''Neural Fictitious Self-Play in Imperfect Information Games with Many Players''. [[Conferences#IJCA2017IJCAI2017|Computer Games Workshop at CGW@IJCAI 2017]], [http://www.lamsade.dauphine.fr/~cazenave/cgw2017/Kawamura.pdf pdf]* [[Thomas Philip Runarsson]] ('''2017'''). ''[https://link.springer.com/chapter/10.1007/978-3-319-75931-9_3 Deep Preference Neural Network for Move Prediction in Board Games]''. [[Conferences#IJCAI2017|CGW@IJCAI 2017]]
* [[Marc Lanctot]], [[Vinícius Flores Zambaldi]], [[Audrunas Gruslys]], [[Angeliki Lazaridou]], [[Karl Tuyls]], [[Julien Pérolat]], [[David Silver]], [[Thore Graepel]] ('''2017'''). ''A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning''. [https://arxiv.org/abs/1711.00832 arXiv:1711.00832]
* [[David Silver]], [[Julian Schrittwieser]], [[Karen Simonyan]], [[Ioannis Antonoglou]], [[Shih-Chieh Huang|Aja Huang]], [[Arthur Guez]], [[Thomas Hubert]], [[Lucas Baker]], [[Matthew Lai]], [[Adrian Bolton]], [[Yutian Chen]], [[Timothy Lillicrap]], [[Fan Hui]], [[Laurent Sifre]], [[George van den Driessche]], [[Thore Graepel]], [[Demis Hassabis]] ('''2017'''). ''[https://www.nature.com/nature/journal/v550/n7676/full/nature24270.html Mastering the game of Go without human knowledge]''. [https://en.wikipedia.org/wiki/Nature_%28journal%29 Nature], Vol. 550, [https://www.gwern.net/docs/rl/2017-silver.pdf pdf] <ref>[https://deepmind.com/blog/alphago-zero-learning-scratch/ AlphaGo Zero: Learning from scratch] by [[Demis Hassabis]] and [[David Silver]], [[DeepMind]], October 18, 2017</ref>

Navigation menu