Difference between revisions of "David Silver"

From Chessprogramming wiki
Jump to: navigation, search
(Created page with "'''Home * People * David Silver''' FILE:Dave_Silver.jpg|border|right|thumb|link=http://www.cs.ucl.ac.uk/staff/D.Silver/web/Home.html| David Silver <ref>[...")
 
Line 19: Line 19:
 
* [[Sylvain Gelly]], [[David Silver]] ('''2007'''). ''Combining Online and Offline Knowledge in UCT.'' [http://www.machinelearning.org/proceedings/icml2007/papers/387.pdf pdf]
 
* [[Sylvain Gelly]], [[David Silver]] ('''2007'''). ''Combining Online and Offline Knowledge in UCT.'' [http://www.machinelearning.org/proceedings/icml2007/papers/387.pdf pdf]
 
'''2008'''
 
'''2008'''
* [[David Silver]], [[Richard Sutton]] and [[Martin Müller]] ('''2008'''). ''Sample-Based Learning and Search with Permanent and Transient Memories''. In Proceedings of the 25th International Conference on Machine Learning, [http://icml2008.cs.helsinki.fi/papers/564.pdf pdf]
+
* [[David Silver]], [[Richard Sutton]], [[Martin Müller]] ('''2008'''). ''Sample-Based Learning and Search with Permanent and Transient Memories''. In Proceedings of the 25th International Conference on Machine Learning, [http://icml2008.cs.helsinki.fi/papers/564.pdf pdf]
 
* [[Sylvain Gelly]], [[David Silver]] ('''2008'''). ''Achieving Master Level Play in 9 x 9 Computer Go.'' [http://www.lri.fr/~gelly/paper/MoGoNectar.pdf pdf]
 
* [[Sylvain Gelly]], [[David Silver]] ('''2008'''). ''Achieving Master Level Play in 9 x 9 Computer Go.'' [http://www.lri.fr/~gelly/paper/MoGoNectar.pdf pdf]
 
'''2009'''
 
'''2009'''
 
* [[David Silver]], [[Gerald Tesauro]] ('''2009'''). ''Monte-Carlo Simulation Balancing''. In Proceedings of the 26th International Conference on Machine Learning (ICML-09).  
 
* [[David Silver]], [[Gerald Tesauro]] ('''2009'''). ''Monte-Carlo Simulation Balancing''. In Proceedings of the 26th International Conference on Machine Learning (ICML-09).  
* [[Richard Sutton]], [[Hamid Reza Maei]], [[Doina Precup]], [[Shalabh Bhatnagar]], [[David Silver]], [[Csaba Szepesvári]] and [[Eric Wiewiora]]. ('''2009'''). ''Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation''. In Proceedings of the 26th International Conference on Machine Learning (ICML-09). [http://www.sztaki.hu/~szcsaba/papers/GTD-ICML09.pdf pdf]
+
* [[Richard Sutton]], [[Hamid Reza Maei]], [[Doina Precup]], [[Shalabh Bhatnagar]], [[David Silver]], [[Csaba Szepesvári]], [[Eric Wiewiora]]. ('''2009'''). ''Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation''. In Proceedings of the 26th International Conference on Machine Learning (ICML-09). [http://www.sztaki.hu/~szcsaba/papers/GTD-ICML09.pdf pdf]
 
* [[Hamid Reza Maei]], [[Csaba Szepesvári]], [[Shalabh Bhatnagar]], [[Doina Precup]], [[David Silver]], [[Richard Sutton]] ('''2009'''). ''Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation.'' Accepted in Advances in Neural Information Processing Systems 22, Vancouver, BC. December 2009. MIT Press. [http://books.nips.cc/papers/files/nips22/NIPS2009_1121.pdf pdf]
 
* [[Hamid Reza Maei]], [[Csaba Szepesvári]], [[Shalabh Bhatnagar]], [[Doina Precup]], [[David Silver]], [[Richard Sutton]] ('''2009'''). ''Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation.'' Accepted in Advances in Neural Information Processing Systems 22, Vancouver, BC. December 2009. MIT Press. [http://books.nips.cc/papers/files/nips22/NIPS2009_1121.pdf pdf]
 
* [[Joel Veness]], [[David Silver]], [[William Uther]], [[Alan Blair]] ('''2009'''). ''[http://papers.nips.cc/paper/3722-bootstrapping-from-game-tree-search Bootstrapping from Game Tree Search]''. [http://webdocs.cs.ualberta.ca/~silver/David_Silver/Applications_files/bootstrapping.pdf pdf]
 
* [[Joel Veness]], [[David Silver]], [[William Uther]], [[Alan Blair]] ('''2009'''). ''[http://papers.nips.cc/paper/3722-bootstrapping-from-game-tree-search Bootstrapping from Game Tree Search]''. [http://webdocs.cs.ualberta.ca/~silver/David_Silver/Applications_files/bootstrapping.pdf pdf]

Revision as of 20:59, 31 May 2018

Home * People * David Silver

David Silver [1]

David Silver,
a British computer scientist at Google DeepMind, and co-author of AlphaGo and AlphaZero. Before, since 2010, he was researcher at University College London, postdoc at Massachusetts Institute of Technology [2], Ph.D student and postdoc at University of Alberta, and CTO for Elixir Studios and lead programmer on the PC strategy game Republic: the Revolution [3]. His research interests covers simulation-based search, reinforcement learning, and cooperative pathfinding.

See also

Selected Publications

[4] [5] [6]

2006 ...

  • David Silver (2006). Cooperative Pathfinding. In AI Game Programming Wisdom 3, pages 99–111. Charles River Media, pdf

2007

2008

2009

2010 ...

2011

2012

2013

2014

2015 ...

2016

2017

External Links

AlphaGo Zero: Discovering new knowledge by David Silver, YouTube Video

References

Up one level