Changes

Jump to: navigation, search

Temporal Difference Learning

992 bytes removed, 09:17, 9 June 2018
no edit summary
* [[Daniel Osman]] ('''2007'''). ''Temporal Difference Methods for Two-player Board Games''. Ph.D. thesis, Faculty of Mathematics and Information Science, [https://en.wikipedia.org/wiki/Warsaw_University_of_Technology Warsaw University of Technology]
* [[Yasuhiro Osaki]], [[Kazutomo Shibahara]], [[Yasuhiro Tajima]], [[Yoshiyuki Kotani]] ('''2007'''). ''Reinforcement Learning of Evaluation Functions Using Temporal Difference-Monte Carlo learning method''. [[Conferences#GPW|12th Game Programming Workshop]]
* [[Andrew Barto]] ('''2007'''). ''[http://www.scholarpedia.org/article/Temporal_difference_learning Temporal difference learning]''. [https://en.wikipedia.org/wiki/Scholarpedia Scholarpedia] 2(11):1604
'''2008'''
* [[Yasuhiro Osaki]], [[Kazutomo Shibahara]], [[Yasuhiro Tajima]], [[Yoshiyuki Kotani]] ('''2008'''). ''An Othello Evaluation Function Based on Temporal Difference Learning using Probability of Winning''. [http://www.csse.uwa.edu.au/cig08/Proceedings/toc.html CIG'08], [http://www.csse.uwa.edu.au/cig08/Proceedings/papers/8010.pdf pdf]
: [https://en.wiktionary.org/wiki/temporal temporal - Wiktionary]
* [https://en.wikipedia.org/wiki/Reinforcement_learning#Temporal_difference_methods Reinforcement learning - Temporal difference methods from Wikipedia]
* [http://www.scholarpedia.org/article/Temporal_difference_learning Temporal difference learning - Scholarpedia]
* [https://webdocs.cs.ualberta.ca/~sutton/book/ebook/node60.html 6. Temporal-Difference Learning] in [[Richard Sutton]], [[Andrew Barto]] ('''1998'''). ''[http://www.incompleteideas.net/sutton/book/the-book.html Reinforcement Learning: An Introduction]''. [https://en.wikipedia.org/wiki/MIT_Press MIT Press] eBook
* [https://web.stanford.edu/group/pdplab/pdphandbook/handbookch10.html Temporal-Difference Learning] (Chapter 9) in [[James L. McClelland]] ('''2015'''). ''[https://web.stanford.edu/group/pdplab/pdphandbook/handbook3.html#handbookch10.html Explorations in Parallel Distributed Processing: A Handbook of Models, Programs, and Exercises]''. Second Edition, [https://web.stanford.edu/group/pdplab/pdphandbook/handbookli1.html Contents]
* [https://www.tu-chemnitz.de/informatik/KI/scripts/ws0910/ml09_6.pdf Temporal-Difference learning], Slides as pdf from [[Chemnitz University of Technology]]
* [http://www.bcp.psych.ualberta.ca/~mike/Pearl_Street/Dictionary/contents/C/creditassign.html University of Alberta Dictionary of Cognitive Science: Credit Assignment Problem]
* [[Videos#ShawnLane|Shawn Lane]], [[Videos#JonasHellborg|Jonas Hellborg]], [[Videos#JeffSipe|Jeff Sipe]] - [https://en.wikipedia.org/wiki/Temporal_Analogues_of_Paradise Temporal Analogues of Paradise - 2nd Movement], [https://en.wikipedia.org/wiki/Atlanta Atlanta] Drums & Percussion, August 20, 1996, [https://en.wikipedia.org/wiki/YouTube YouTube] Video
: {{#evu:https://www.youtube.com/watch?v=oJ_MfyrQT8I|alignment=left|valignment=top}}

Navigation menu