Changes

Jump to: navigation, search

David Silver

21 bytes added, 16:58, 4 July 2020
no edit summary
* [[Johannes Heinrich]], [[Marc Lanctot]], [[David Silver]] ('''2015'''). ''Fictitious Self-Play in Extensive-Form Games''. [http://proceedings.mlr.press/v37/ JMLR: W&CP, Vol. 37], [http://proceedings.mlr.press/v37/heinrich15.pdf pdf]
* [[Johannes Heinrich]], [[David Silver]] ('''2015'''). ''Smooth UCT Search in Computer Poker''. [[Conferences#IJCA2015|IJCAI 2015]], [http://www0.cs.ucl.ac.uk/staff/d.silver/web/Publications_files/smooth_uct.pdf pdf]
* [[Volodymyr Mnih]], [[Koray Kavukcuoglu]], [[David Silver]], [[Mathematician#AARusu|Andrei A. Rusu]], [[Joel Veness]], [[Marc G. Bellemare]], [[Alex Graves]], [[Martin Riedmiller]], [[Andreas K. Fidjeland]], [[Georg Ostrovski]], [[Stig Petersen]], [[Charles Beattie]], [[Amir Sadik]], [[Ioannis Antonoglou]], [[Helen King]], [[Dharshan Kumaran]], [[Daan Wierstra]], [[Shane Legg]], [[Demis Hassabis]] ('''2015'''). ''[http://www.nature.com/nature/journal/v518/n7540/abs/nature14236.html Human-level control through deep reinforcement learning]''. [https://en.wikipedia.org/wiki/Nature_%28journal%29 Nature], Vol. 518
* [[Arun Nair]], [[Praveen Srinivasan]], [[Sam Blackwell]], [[Cagdas Alcicek]], [[Rory Fearon]], [[Alessandro De Maria]], [[Veda Panneershelvam]], [[Mustafa Suleyman]], [[Charles Beattie]], [[Stig Petersen]], [[Shane Legg]], [[Volodymyr Mnih]], [[Koray Kavukcuoglu]], [[David Silver]] ('''2015'''). ''Massively Parallel Methods for Deep Reinforcement Learning''. [http://arxiv.org/abs/1507.04296 arXiv:1507.04296]
* [[Timothy Lillicrap]], [[Jonathan J. Hunt]], [[Alexander Pritzel]], [[Nicolas Heess]], [[Tom Erez]], [[Yuval Tassa]], [[David Silver]], [[Daan Wierstra]] ('''2015'''). ''Continuous Control with Deep Reinforcement Learning''. [https://arxiv.org/abs/1509.02971 arXiv:1509.02971]

Navigation menu