Changes

Marco Wiering

73 bytes added, 09:49, 15 April 2021

no edit summary

* [[Marco Wiering]] ('''1999'''). ''Explorations in Efficient Reinforcement Learning''. Ph.D. thesis, [https://en.wikipedia.org/wiki/University_of_Amsterdam University of Amsterdam], advisors [[Mathematician#FGroen|Frans Groen]] and [[Jürgen Schmidhuber]]

==2000 ...==

* [[Henk Mannen]], [[Marco Wiering]] ('''~~2000~~2004'''). ''~~Multi-agent reinforcement learning for traffic light control''. ICML,~~ [~~http~~https://www.~~dcsc~~semanticscholar.~~tudelft.nl~~org/~~~sc4081~~paper/~~assign/pap/Reinforcement_Learning.pdf pdf]~~* [[Henk Learning-to-Play-Chess-using-TD(lambda)-learning-Mannen~~]], [[Marco~~ -Wiering~~]] ('''2004'''). ''~~/00a6f81c8ebe8408c147841f26ed27eb13fb07f3 Learning to play chess using TD(λ)-learning with database games]''. ~~[http://students.uu.nl/en/hum/cognitive-artificial-intelligence~~ Cognitive Artiﬁcial Intelligence], [https://en.wikipedia.org/wiki/Utrecht_University Utrecht University], Benelearn’04, [https://www.ai.rug.nl/~mwiering/GROUP/ARTICLES/learning-chess.pdf pdf]* [~~http~~https://dblp.~~uni-trier.de~~org/~~pers~~pid/hd20/~~p/Patist:Jan_Peter~~ 4400.html Jan Peter Patist], [[Marco Wiering]] ('''2004'''). ''Learning to Play Draughts using Temporal Difference Learning with Neural Networks and Databases''. [http://students.uu.nl/en/hum/cognitive-artificial-intelligence Cognitive Artiﬁcial Intelligence], [https://en.wikipedia.org/wiki/Utrecht_University Utrecht University], Benelearn’04

==2005 ...==

* [[Marco Wiering]], [~~http~~https://dblp.~~uni-trier.de~~org/~~pers~~pid/hd20/~~p/Patist:Jan_Peter~~ 4400.html Jan Peter Patist], [[Henk Mannen]] ('''2005'''). ''[https://www.semanticscholar.org/paper/Learning-to-Play-Board-Games-using-Temporal-Methods-Wiering-Patist/7410e2bf16ed184db89f0e3acbbfdad473623b7a Learning to Play Board Games using Temporal Difference Methods]''. Technical Report, [https://en.wikipedia.org/wiki/Utrecht_University Utrecht University], UU-CS-2005-048, [http://~~www~~webdoc.aisub.~~rug~~gwdg.nlde/ebook/~~~mwiering~~serien/~~GROUP~~ah/~~ARTICLES~~UU-CS/~~learning_games_TR~~2005-048.pdf pdf]

* [[Marco Wiering]] ('''2005'''). ''QV (λ)-learning: A new on-policy reinforcement learning algorithm''. Proceedings of the 7th European Workshop on Reinforcement Learning, [http://www.ai.rug.nl/~mwiering/GROUP/ARTICLES/QV_learning_ewrl.pdf pdf]

==2010 ...==

GerdIsenberg

Bureaucrats, Administrators

25,161

edits

Changes

Marco Wiering

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools