Changes

Jump to: navigation, search

UCT

256 bytes added, 18:08, 7 March 2019
no edit summary
* [[Damien Pellier]], [[Bruno Bouzy]], [[Marc Métivier]] ('''2010'''). ''An UCT Approach for Anytime Agent-based Planning''. [http://www.springer.com/de/book/9783642123832 PAAMS'10], [http://www.mi.parisdescartes.fr/~bouzy/publications/pellier-bouzy-metivier-paams10.pdf pdf]
* [[Thomas J. Walsh]], [[Sergiu Goschin]], [[Michael L. Littman]] ('''2010'''). ''Integrating sample-based planning and model-based reinforcement learning.'' [[AAAI]], [https://pdfs.semanticscholar.org/bdc9/bfb6ecc6fb5afb684df03d7220c46ebdbf4e.pdf pdf] » [[Reinforcement Learning]]
* [[Takayuki Yajima]], [[Tsuyoshi Hashimoto]], [[Toshiki Matsui]], [[Junichi Hashimoto]], [[Kristian Spoerer]] ('''2010'''). ''[https://link.springer.com/chapter/10.1007%2F978-3-642-17928-0_11 Node-Expansion Operators for the UCT Algorithm]''. [[CG 2010]]
'''2011'''
* [[Junichi Hashimoto]], [[Akihiro Kishimoto]], [[Kazuki Yoshizoe]], [[Kokolo Ikeda]] ('''2011'''). ''[http://link.springer.com/chapter/10.1007/978-3-642-31866-5_1 Accelerated UCT and Its Application to Two-Player Games]''. [[Advances in Computer Games 13]]

Navigation menu