Changes

Jump to: navigation, search

Temporal Difference Learning

328 bytes added, 13:59, 23 May 2021
no edit summary
* [[Marco Block-Berlitz|Marco Block]] ('''2004'''). ''Verwendung von Temporale-Differenz-Methoden im Schachmotor FUSc#''. Diplomarbeit, Betreuer: [[Raúl Rojas]], [[Free University of Berlin]], [http://page.mi.fu-berlin.de/block/Skripte/diplomarbeit.pdf pdf] (German)
* [[Jacek Mańdziuk]], [[Daniel Osman]] ('''2004'''). ''Temporal Difference Approach to Playing Give-Away Checkers''. [http://www.informatik.uni-trier.de/~ley/db/conf/icaisc/icaisc2004.html#MandziukO04 ICAISC 2004], [http://www.mini.pw.edu.pl/~mandziuk/PRACE/ICAISC04-3.pdf pdf]
* [[Kristian Kersting]], [[Mathematician#LDRaedt|Luc De Raedt]] ('''2004'''). ''[https://lirias.kuleuven.be/1992259?limo=0 Logical Markov Decision Programs and the Convergence of Logical TD(lambda)]''. [https://dblp.org/db/conf/ilp/ilp2004.html#KerstingR04 ILP 2004], [http://people.csail.mit.edu/kersting/papers/ilp04.pdf pdf]
==2005 ...==
* [[Marco Wiering]], [http://dblp.uni-trier.de/pers/hd/p/Patist:Jan_Peter Jan Peter Patist], [[Henk Mannen]] ('''2005'''). ''Learning to Play Board Games using Temporal Difference Methods''. Technical Report, [https://en.wikipedia.org/wiki/Utrecht_University Utrecht University], UU-CS-2005-048, [http://www.ai.rug.nl/~mwiering/GROUP/ARTICLES/learning_games_TR.pdf pdf]

Navigation menu