Changes

Jump to: navigation, search

Temporal Difference Learning

327 bytes added, 11:06, 11 March 2019
no edit summary
* [[Mark Winands]], [[Levente Kocsis]], [[Jos Uiterwijk]], [[Jaap van den Herik]] ('''2002'''). ''Temporal difference learning and the Neural MoveMap heuristic in the game of Lines of Action''. GAME-ON 2002 » [[Neural MoveMap Heuristic]]
* [[James Swafford]] ('''2002'''). ''Optimizing Parameter Learning using Temporal Differences''. [http://www.aaai.org/Conferences/AAAI/aaai02.php AAAI-02], Student Abstracts, [https://www.aaai.org/Papers/AAAI/2002/AAAI02-150.pdf pdf]
* [[Justin A. Boyan]] ('''2002'''). ''[https://link.springer.com/article/10.1023%2FA%3A1017936530646 Technical Update: Least-Squares Temporal Difference Learning]''. [https://en.wikipedia.org/wiki/Machine_Learning_(journal) Machine Learning], Vol. 49, [http://research.cs.rutgers.edu/~lihong/project/ahlp/boyan02least.pdf pdf]
'''2003'''
* [[Henk Mannen]] ('''2003'''). ''Learning to play chess using reinforcement learning with database games''. Master’s thesis, [http://students.uu.nl/en/hum/cognitive-artificial-intelligence Cognitive Artificial Intelligence], [https://en.wikipedia.org/wiki/Utrecht_University Utrecht University]

Navigation menu