Changes

Jump to: navigation, search

David Silver

6 bytes removed, 20:59, 31 May 2018
no edit summary
* [[Sylvain Gelly]], [[David Silver]] ('''2007'''). ''Combining Online and Offline Knowledge in UCT.'' [http://www.machinelearning.org/proceedings/icml2007/papers/387.pdf pdf]
'''2008'''
* [[David Silver]], [[Richard Sutton]] and , [[Martin Müller]] ('''2008'''). ''Sample-Based Learning and Search with Permanent and Transient Memories''. In Proceedings of the 25th International Conference on Machine Learning, [http://icml2008.cs.helsinki.fi/papers/564.pdf pdf]
* [[Sylvain Gelly]], [[David Silver]] ('''2008'''). ''Achieving Master Level Play in 9 x 9 Computer Go.'' [http://www.lri.fr/~gelly/paper/MoGoNectar.pdf pdf]
'''2009'''
* [[David Silver]], [[Gerald Tesauro]] ('''2009'''). ''Monte-Carlo Simulation Balancing''. In Proceedings of the 26th International Conference on Machine Learning (ICML-09).
* [[Richard Sutton]], [[Hamid Reza Maei]], [[Doina Precup]], [[Shalabh Bhatnagar]], [[David Silver]], [[Csaba Szepesvári]] and , [[Eric Wiewiora]]. ('''2009'''). ''Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation''. In Proceedings of the 26th International Conference on Machine Learning (ICML-09). [http://www.sztaki.hu/~szcsaba/papers/GTD-ICML09.pdf pdf]
* [[Hamid Reza Maei]], [[Csaba Szepesvári]], [[Shalabh Bhatnagar]], [[Doina Precup]], [[David Silver]], [[Richard Sutton]] ('''2009'''). ''Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation.'' Accepted in Advances in Neural Information Processing Systems 22, Vancouver, BC. December 2009. MIT Press. [http://books.nips.cc/papers/files/nips22/NIPS2009_1121.pdf pdf]
* [[Joel Veness]], [[David Silver]], [[William Uther]], [[Alan Blair]] ('''2009'''). ''[http://papers.nips.cc/paper/3722-bootstrapping-from-game-tree-search Bootstrapping from Game Tree Search]''. [http://webdocs.cs.ualberta.ca/~silver/David_Silver/Applications_files/bootstrapping.pdf pdf]

Navigation menu