Changes

← Older edit

David Silver

17 bytes removed, 11:24, 16 April 2021

no edit summary

'''2009'''

* [[David Silver]], [[Gerald Tesauro]] ('''2009'''). ''Monte-Carlo Simulation Balancing''. In Proceedings of the 26th International Conference on Machine Learning (ICML-09).

* ~~[[Richard Sutton]],~~ [[Hamid Reza Maei]], [[~~Doina Precup~~Csaba Szepesvári]], [[Shalabh Bhatnagar]], [[~~David Silver~~Doina Precup]], [[~~Csaba Szepesvári~~David Silver]], [[~~Eric Wiewiora~~Richard Sutton]]. ('''2009'''). ''~~Fast Gradient-Descent Methods for~~ Convergent Temporal-Difference Learning with ~~Linear~~ Arbitrary Smooth Function Approximation''. ~~In Proceedings of the 26th International Conference on Machine Learning (ICML~~[https://dblp.uni-~~09)~~trier.de/db/conf/nips/nips2009. html#MaeiSBPSS09 NIPS 2009], [~~http~~https://~~www~~papers.~~sztaki~~nips.hucc/~~~szcsaba~~paper/~~papers~~2009/file/~~GTD~~3a15c7d0bbe60300a39f76f8a5ba6896-~~ICML09~~Paper.pdf pdf]* [[Richard Sutton]], [[Hamid Reza Maei]], [[~~Csaba Szepesvári~~Doina Precup]], [[Shalabh Bhatnagar]], [[~~Doina Precup~~David Silver]], [[~~David Silver~~Csaba Szepesvári]], [[~~Richard Sutton~~Eric Wiewiora]] . ('''2009'''). ''~~Convergent~~ [https://dl.acm.org/doi/10.1145/1553374.1553501 Fast Gradient-Descent Methods for Temporal-Difference Learning with ~~Arbitrary Smooth~~ Linear Function Approximation.]'' ~~Accepted in Advances in Neural Information Processing Systems 22, Vancouver, BC. December 2009. MIT Press~~. [~~http~~https://~~books~~dblp.~~nips~~uni-trier.ccde/~~papers~~db/~~files~~conf/~~nips22~~icml/~~NIPS2009_1121~~icml2009.~~pdf pdf~~html#SuttonMPBSSW09 ICML 2009]

* [[Joel Veness]], [[David Silver]], [[William Uther]], [[Alan Blair]] ('''2009'''). ''[http://papers.nips.cc/paper/3722-bootstrapping-from-game-tree-search Bootstrapping from Game Tree Search]''. [http://webdocs.cs.ualberta.ca/~silver/David_Silver/Applications_files/bootstrapping.pdf pdf]

* [[David Silver]] ('''2009'''). ''Reinforcement Learning and Simulation-Based Search''. Ph.D. thesis, [[University of Alberta]], [http://www0.cs.ucl.ac.uk/staff/D.Silver/web/Applications_files/thesis.pdf pdf]

* [[Julian Schrittwieser]], [[Ioannis Antonoglou]], [[Thomas Hubert]], [[Karen Simonyan]], [[Laurent Sifre]], [[Simon Schmitt]], [[Arthur Guez]], [[Edward Lockhart]], [[Demis Hassabis]], [[Thore Graepel]], [[Timothy Lillicrap]], [[David Silver]] ('''2019'''). ''Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model''. [https://arxiv.org/abs/1911.08265 arXiv:1911.08265]

==2020 ...==

* [[Julian Schrittwieser]], [[Ioannis Antonoglou]], [[Thomas Hubert]], [[Karen Simonyan]], [[Laurent Sifre]], [[Simon Schmitt]], [[Arthur Guez]], [[Edward ~~Lockhar~~Lockhart]], [[Demis Hassabis]], [[Thore Graepel]], [[Timothy Lillicrap]], [[David Silver]] ('''2020'''). ''[https://www.nature.com/articles/s41586-020-03051-4 Mastering Atari, Go, chess and shogi by planning with a learned model]''. [https://en.wikipedia.org/wiki/Nature_%28journal%29 Nature], Vol. 588 <ref>[https://deepmind.com/blog/article/muzero-mastering-go-chess-shogi-and-atari-without-rules?fbclid=IwAR3mSwrn1YXDKr9uuGm2GlFKh76wBilex7f8QvBiQecwiVmAvD6Bkyjx-rE MuZero: Mastering Go, chess, shogi and Atari without rules]</ref>

=External Links=

GerdIsenberg

Bureaucrats, Administrators

25,161

edits

Changes

David Silver

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools