Changes

Jump to: navigation, search

Learning

245 bytes added, 15:41, 4 March 2019
no edit summary
* [[Hado van Hasselt]], [[Arthur Guez]], [[David Silver]] ('''2015'''). ''Deep Reinforcement Learning with Double Q-learning''. [http://arxiv.org/abs/1509.06461 arXiv:1509.06461]
* [[Tom Schaul]], [[John Quan]], [[Ioannis Antonoglou]], [[David Silver]] ('''2015'''). ''Prioritized Experience Replay''. [http://arxiv.org/abs/1511.05952 arXiv:1511.05952]
* [[Miroslav Kubat]] ('''2015'''). ''[httphttps://www.springer.com/us/book/9783319348865#otherversion=9783319200095 An Introduction to Machine Learning]''. [https://deen.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer]
* [[Christian Wirth]], [[Johannes Fürnkranz]] ('''2015'''). ''[http://ieeexplore.ieee.org/document/6861960/ On Learning From Game Annotations]''. [[IEEE#TOCIAIGAMES|IEEE Transactions on Computational Intelligence and AI in Games]], Vol. 7, No. 3
'''2016'''
* [[Johannes Fürnkranz]] ('''2017'''). ''Machine Learning and Game Playing''. in [https://en.wikipedia.org/wiki/Claude_Sammut Claude Sammut], [https://en.wikipedia.org/wiki/Geoff_Webb Geoffrey I. Webb] (eds) ('''2017'''). ''[https://link.springer.com/referencework/10.1007%2F978-1-4899-7687-1 Encyclopedia of Machine Learning and Data Mining]''. [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer], [https://en.wikipedia.org/wiki/Boston Boston, MA]
* [[David Silver]], [[Thomas Hubert]], [[Julian Schrittwieser]], [[Ioannis Antonoglou]], [[Matthew Lai]], [[Arthur Guez]], [[Marc Lanctot]], [[Laurent Sifre]], [[Dharshan Kumaran]], [[Thore Graepel]], [[Timothy Lillicrap]], [[Karen Simonyan]], [[Demis Hassabis]] ('''2017'''). ''Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm''. [https://arxiv.org/abs/1712.01815 arXiv:1712.01815] » [[AlphaZero]]
* [[Miroslav Kubat]] ('''2017'''). ''[https://www.springer.com/gp/book/9783319639123 An Introduction to Machine Learning]''. Second Edition, [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer]
'''2018'''
* [[Arthur Guez]], [[Théophane Weber]], [[Ioannis Antonoglou]], [[Karen Simonyan]], [[Oriol Vinyals]], [[Daan Wierstra]], [[Rémi Munos]], [[David Silver]] ('''2018'''). ''Learning to Search with MCTSnets''. [https://arxiv.org/abs/1802.04697 arXiv:1802.04697] » [[Monte-Carlo Tree Search]]

Navigation menu