Difference between revisions of "David Silver"

From Chessprogramming wiki
Jump to: navigation, search
Line 50: Line 50:
 
* [[Johannes Heinrich]], [[Marc Lanctot]], [[David Silver]] ('''2015'''). ''Fictitious Self-Play in Extensive-Form Games''. [http://proceedings.mlr.press/v37/ JMLR: W&CP, Vol. 37], [http://proceedings.mlr.press/v37/heinrich15.pdf pdf]
 
* [[Johannes Heinrich]], [[Marc Lanctot]], [[David Silver]] ('''2015'''). ''Fictitious Self-Play in Extensive-Form Games''. [http://proceedings.mlr.press/v37/ JMLR: W&CP, Vol. 37], [http://proceedings.mlr.press/v37/heinrich15.pdf pdf]
 
* [[Johannes Heinrich]], [[David Silver]] ('''2015'''). ''Smooth UCT Search in Computer Poker''. [[Conferences#IJCA2015|IJCAI 2015]], [http://www0.cs.ucl.ac.uk/staff/d.silver/web/Publications_files/smooth_uct.pdf pdf]
 
* [[Johannes Heinrich]], [[David Silver]] ('''2015'''). ''Smooth UCT Search in Computer Poker''. [[Conferences#IJCA2015|IJCAI 2015]], [http://www0.cs.ucl.ac.uk/staff/d.silver/web/Publications_files/smooth_uct.pdf pdf]
* [[Volodymyr Mnih]], [[Koray Kavukcuoglu]], [[David Silver]], [[Andrei A. Rusu]], [[Joel Veness]], [[Marc G. Bellemare]], [[Alex Graves]], [[Martin Riedmiller]], [[Andreas K. Fidjeland]], [[Georg Ostrovski]], [[Stig Petersen]], [[Charles Beattie]], [[Amir Sadik]], [[Ioannis Antonoglou]], [[Helen King]], [[Dharshan Kumaran]], [[Daan Wierstra]], [[Shane Legg]], [[Demis Hassabis]] ('''2015'''). ''[http://www.nature.com/nature/journal/v518/n7540/abs/nature14236.html Human-level control through deep reinforcement learning]''. [https://en.wikipedia.org/wiki/Nature_%28journal%29 Nature], Vol. 518
+
* [[Volodymyr Mnih]], [[Koray Kavukcuoglu]], [[David Silver]], [[Mathematician#AARusu|Andrei A. Rusu]], [[Joel Veness]], [[Marc G. Bellemare]], [[Alex Graves]], [[Martin Riedmiller]], [[Andreas K. Fidjeland]], [[Georg Ostrovski]], [[Stig Petersen]], [[Charles Beattie]], [[Amir Sadik]], [[Ioannis Antonoglou]], [[Helen King]], [[Dharshan Kumaran]], [[Daan Wierstra]], [[Shane Legg]], [[Demis Hassabis]] ('''2015'''). ''[http://www.nature.com/nature/journal/v518/n7540/abs/nature14236.html Human-level control through deep reinforcement learning]''. [https://en.wikipedia.org/wiki/Nature_%28journal%29 Nature], Vol. 518
 
* [[Arun Nair]], [[Praveen Srinivasan]], [[Sam Blackwell]], [[Cagdas Alcicek]], [[Rory Fearon]], [[Alessandro De Maria]], [[Veda Panneershelvam]], [[Mustafa Suleyman]], [[Charles Beattie]], [[Stig Petersen]], [[Shane Legg]], [[Volodymyr Mnih]], [[Koray Kavukcuoglu]], [[David Silver]] ('''2015'''). ''Massively Parallel Methods for Deep Reinforcement Learning''. [http://arxiv.org/abs/1507.04296 arXiv:1507.04296]
 
* [[Arun Nair]], [[Praveen Srinivasan]], [[Sam Blackwell]], [[Cagdas Alcicek]], [[Rory Fearon]], [[Alessandro De Maria]], [[Veda Panneershelvam]], [[Mustafa Suleyman]], [[Charles Beattie]], [[Stig Petersen]], [[Shane Legg]], [[Volodymyr Mnih]], [[Koray Kavukcuoglu]], [[David Silver]] ('''2015'''). ''Massively Parallel Methods for Deep Reinforcement Learning''. [http://arxiv.org/abs/1507.04296 arXiv:1507.04296]
 
* [[Timothy Lillicrap]], [[Jonathan J. Hunt]], [[Alexander Pritzel]], [[Nicolas Heess]], [[Tom Erez]], [[Yuval Tassa]], [[David Silver]], [[Daan Wierstra]] ('''2015'''). ''Continuous Control with Deep Reinforcement Learning''. [https://arxiv.org/abs/1509.02971 arXiv:1509.02971]
 
* [[Timothy Lillicrap]], [[Jonathan J. Hunt]], [[Alexander Pritzel]], [[Nicolas Heess]], [[Tom Erez]], [[Yuval Tassa]], [[David Silver]], [[Daan Wierstra]] ('''2015'''). ''Continuous Control with Deep Reinforcement Learning''. [https://arxiv.org/abs/1509.02971 arXiv:1509.02971]

Revision as of 16:58, 4 July 2020

Home * People * David Silver

David Silver [1]

David Silver,
a British computer scientist at Google DeepMind, and co-author of AlphaGo and AlphaZero. Before, since 2010, he was researcher at University College London, postdoc at Massachusetts Institute of Technology [2], Ph.D student and postdoc at University of Alberta, and CTO for Elixir Studios and lead programmer on the PC strategy game Republic: the Revolution [3]. His research interests covers simulation-based search, reinforcement learning, and cooperative pathfinding.

See also

Selected Publications

[4] [5] [6]

2006 ...

  • David Silver (2006). Cooperative Pathfinding. In AI Game Programming Wisdom 3, pages 99–111. Charles River Media, pdf

2007

2008

2009

2010 ...

2011

2012

2013

2014

2015 ...

2016

2017

2018

2019

External Links

AlphaGo Zero: Discovering new knowledge by David Silver, YouTube Video

References

Up one level