Difference between revisions of "Marco Wiering"

From Chessprogramming wiki
Jump to: navigation, search
Line 14: Line 14:
 
* [[Marco Wiering]] ('''1999'''). ''Explorations in Efficient Reinforcement Learning''. Ph.D. thesis, [https://en.wikipedia.org/wiki/University_of_Amsterdam University of Amsterdam], advisors [[Mathematician#FGroen|Frans Groen]] and [[Jürgen Schmidhuber]]
 
* [[Marco Wiering]] ('''1999'''). ''Explorations in Efficient Reinforcement Learning''. Ph.D. thesis, [https://en.wikipedia.org/wiki/University_of_Amsterdam University of Amsterdam], advisors [[Mathematician#FGroen|Frans Groen]] and [[Jürgen Schmidhuber]]
 
==2000 ...==
 
==2000 ...==
* [[Marco Wiering]] ('''2000'''). ''Multi-agent reinforcement learning for traffic light control''. ICML, [http://www.dcsc.tudelft.nl/~sc4081/assign/pap/Reinforcement_Learning.pdf pdf]
+
* [[Henk Mannen]], [[Marco Wiering]] ('''2004'''). ''[https://www.semanticscholar.org/paper/Learning-to-Play-Chess-using-TD(lambda)-learning-Mannen-Wiering/00a6f81c8ebe8408c147841f26ed27eb13fb07f3 Learning to play chess using TD(λ)-learning with database games]''. Cognitive Artificial Intelligence, [https://en.wikipedia.org/wiki/Utrecht_University Utrecht University], Benelearn’04, [https://www.ai.rug.nl/~mwiering/GROUP/ARTICLES/learning-chess.pdf pdf]
* [[Henk Mannen]], [[Marco Wiering]] ('''2004'''). ''Learning to play chess using TD(λ)-learning with database games''. [http://students.uu.nl/en/hum/cognitive-artificial-intelligence Cognitive Artificial Intelligence], [https://en.wikipedia.org/wiki/Utrecht_University Utrecht University], Benelearn’04
+
* [https://dblp.org/pid/20/4400.html Jan Peter Patist], [[Marco Wiering]] ('''2004'''). ''Learning to Play Draughts using Temporal Difference Learning with Neural Networks and Databases''. [http://students.uu.nl/en/hum/cognitive-artificial-intelligence Cognitive Artificial Intelligence], [https://en.wikipedia.org/wiki/Utrecht_University Utrecht University], Benelearn’04
* [http://dblp.uni-trier.de/pers/hd/p/Patist:Jan_Peter Jan Peter Patist], [[Marco Wiering]] ('''2004'''). ''Learning to Play Draughts using Temporal Difference Learning with Neural Networks and Databases''. [http://students.uu.nl/en/hum/cognitive-artificial-intelligence Cognitive Artificial Intelligence], [https://en.wikipedia.org/wiki/Utrecht_University Utrecht University], Benelearn’04
 
 
==2005 ...==
 
==2005 ...==
* [[Marco Wiering]], [http://dblp.uni-trier.de/pers/hd/p/Patist:Jan_Peter Jan Peter Patist], [[Henk Mannen]] ('''2005'''). ''Learning to Play Board Games using Temporal Difference Methods''. Technical Report, [https://en.wikipedia.org/wiki/Utrecht_University Utrecht University], UU-CS-2005-048, [http://www.ai.rug.nl/~mwiering/GROUP/ARTICLES/learning_games_TR.pdf pdf]
+
* [[Marco Wiering]], [https://dblp.org/pid/20/4400.html Jan Peter Patist], [[Henk Mannen]] ('''2005'''). ''[https://www.semanticscholar.org/paper/Learning-to-Play-Board-Games-using-Temporal-Methods-Wiering-Patist/7410e2bf16ed184db89f0e3acbbfdad473623b7a Learning to Play Board Games using Temporal Difference Methods]''. Technical Report, [https://en.wikipedia.org/wiki/Utrecht_University Utrecht University], UU-CS-2005-048, [http://webdoc.sub.gwdg.de/ebook/serien/ah/UU-CS/2005-048.pdf pdf]
 
* [[Marco Wiering]] ('''2005'''). ''QV (λ)-learning: A new on-policy reinforcement learning algorithm''. Proceedings of the 7th European Workshop on Reinforcement Learning, [http://www.ai.rug.nl/~mwiering/GROUP/ARTICLES/QV_learning_ewrl.pdf pdf]
 
* [[Marco Wiering]] ('''2005'''). ''QV (λ)-learning: A new on-policy reinforcement learning algorithm''. Proceedings of the 7th European Workshop on Reinforcement Learning, [http://www.ai.rug.nl/~mwiering/GROUP/ARTICLES/QV_learning_ewrl.pdf pdf]
 
==2010 ...==
 
==2010 ...==

Revision as of 09:49, 15 April 2021

Home * People * Marco Wiering

Marco Wiering [1]

Marco Alexander Wiering,
a Dutch mathematician, computer scientist, and assistant professor at Faculty of Mathematics and Natural Sciences, artificial intelligence and cognitive engineering, University of Groningen with tenure track for associate professor, and until September 2007 assistant professor at Utrecht University. He holds a Ph.D. on the topic of reinforcement learning from University of Amsterdam in 1999, thesis advisors were Frans Groen and Jürgen Schmidhuber. His research interests include artificial intelligence, machine learning, neural networks, object recognition, pattern recognition, evolutionary computation, robotics, game playing, multi-agent systems, time-series analysis and computer vision [2].

Selected Publications

[3] [4]

1995 ...

2000 ...

2005 ...

2010 ...

2015 ...

External Links

References

Up one level