Changes

Jump to: navigation, search

Learning

163 bytes added, 13:09, 12 April 2021
no edit summary
* [[Sacha Droste]], [[Johannes Fürnkranz]] ('''2008'''). ''Learning of Piece Values for Chess Variants.'' Technical Report TUD–KE–2008-07, Knowledge Engineering Group, [[Darmstadt University of Technology|TU Darmstadt]], [http://www.ke.tu-darmstadt.de/publications/reports/tud-ke-2008-07.pdf pdf]
* [[Sacha Droste]], [[Johannes Fürnkranz]] ('''2008'''). ''Learning the Piece Values for three Chess Variants''. [[ICGA Journal#31_4|ICGA Journal, Vol 31, No. 4]]
* [[Richard Sutton]], [[Csaba Szepesvári]], [[Hamid Reza Maei]] ('''2008'''). ''A Convergent O(n) Algorithm for Off-policy Temporal-difference Learning with Linear Function Approximation''. [https://dblp.uni-trier.de/db/conf/nips/nips2008.html#SuttonSM08 NIPS 2008], [httphttps://wwwproceedings.sztakineurips.hucc/paper/%7Eszcsaba2008/papersfile/gtdnips08e0c641195b27425bb056ac56f8953d24-Paper.pdf pdf] (draft)
* [[Matej Guid]], [[Martin Možina]], [[Jana Krivec]], [[Aleksander Sadikov]], [[Ivan Bratko]] ('''2008'''). ''[http://link.springer.com/chapter/10.1007/978-3-540-87608-3_18 Learning Positional Features for Annotating Chess Games: A Case Study]''. [[CG 2008]], [http://www.ailab.si/matej/doc/Learning_Positional_Features-Case_Study.pdf pdf]
* [[Martin Možina]], [[Matej Guid]], [[Jana Krivec]], [[Aleksander Sadikov]], [[Ivan Bratko]] ('''2008'''). ''Fighting Knowledge Acquisition Bottleneck with Argument Based Machine Learning''. 18th European Conference on Artificial Intelligence (ECAI 2008), Patras, Greece. [http://www.ailab.si/martin/abml/abml_expert_system_for_web.pdf pdf]
* [[Eli David|Omid David]] ('''2009'''). ''Genetic Algorithms Based Learning for Evolving Intelligent Organisms''. Ph.D. Thesis <ref>[[Dap Hartmann]] ('''2010'''). ''Mimicking the Black Box - Genetically evolving evaluation functions and search algorithms''. Review on Omid David's Ph.D. Thesis, [[ICGA Journal#33_1|ICGA Journal, Vol 33, No. 1]]</ref>
* [[Nur Merve Amil]], [[Nicolas Bredèche]], [[Christian Gagné]], [[Sylvain Gelly]], [[Marc Schoenauer]], [[Olivier Teytaud]] ('''2009'''). ''A Statistical Learning Perspective of Genetic Programming''. EuroGP 2009, [http://hal.inria.fr/docs/00/36/97/82/PDF/eurogp.pdf pdf]
* [[Richard Sutton]], [[Hamid Reza Maei]], [[Doina PrecupCsaba Szepesvári]], [[Shalabh Bhatnagar]], [[David SilverDoina Precup]], [[Csaba SzepesváriDavid Silver]], [[Eric WiewioraRichard Sutton]]. ('''2009'''). ''Fast Gradient-Descent Methods for Convergent Temporal-Difference Learning with Linear Arbitrary Smooth Function Approximation''. In Proceedings of the 26th International Conference on Machine Learning (ICML[https://dblp.uni-09)trier.de/db/conf/nips/nips2009. html#MaeiSBPSS09 NIPS 2009], [httphttps://wwwpapers.sztakinips.hucc/~szcsabapaper/papers2009/file/GTD3a15c7d0bbe60300a39f76f8a5ba6896-ICML09Paper.pdf pdf]
* [[Mesut Kirci]], [[Jonathan Schaeffer]], [[Nathan Sturtevant]] ('''2009'''). ''Feature Learning Using State Differences''. [http://web.cs.du.edu/~sturtevant/papers/GGPfeatures.pdf pdf]
* [[David Silver]], [[Gerald Tesauro]] ('''2009'''). ''Monte-Carlo Simulation Balancing''. [http://www.informatik.uni-trier.de/~ley/db/conf/icml/icml2009.html#SilverT09 ICML 2009], [http://www.machinelearning.org/archive/icml2009/papers/500.pdf pdf] <ref>[http://videolectures.net/icml09_silver_mcsb/ Monte-Carlo Simulation Balancing - videolectures.net] by [[David Silver]]</ref>
* [[Julien Pérez]], [[Cécile Germain-Renaud]], [[Balázs Kégl]], [[Charles Loomis]] ('''2010'''). ''Multi-objective Reinforcement Learning for Responsive Grids''. In The Journal of Grid Computing. [http://hal.archives-ouvertes.fr/docs/00/49/15/60/PDF/RLGrid_JGC09_V7.pdf pdf]
* [[Jean-Yves Audibert]] ('''2010'''). ''PAC-Bayesian aggregation and multi-armed bandits''. Habilitation thesis, [http://fr.wikipedia.org/wiki/Universit%C3%A9_Paris-Est Université Paris Est], [http://certis.enpc.fr/~audibert/Mes%20articles/hdr.pdf pdf], [http://certis.enpc.fr/~audibert/Mes%20articles/hdrSlides.pdf slides as pdf]
* [[Hamid Reza Maei]], [[Richard Sutton]] ('''2010'''). ''[httphttps://www.incompleteideasresearchgate.net/suttonpublication/publications.html#GQ 215990384_GQlambda_A_general_gradient_algorithm_for_temporal-difference_prediction_learning_with_eligibility_traces GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces]''. In Proceedings of the Third Conference on Artificial General Intelligence[https://agi-conf.org/2010/ AGI 2010]
* [http://www.informatik.uni-trier.de/~ley/db/indices/a-tree/w/Waledzik:Karol.html Karol Walędzik], [[Jacek Mańdziuk]] ('''2010'''). ''The Layered Learning method and its Application to Generation of Evaluation Functions for the Game of Checkers''. [http://www.informatik.uni-trier.de/~ley/db/conf/ppsn/ppsn2010-2.html#WaledzikM10 11. PPSN], [http://www.mini.pw.edu.pl/~mandziuk/PRACE/PPSN10.pdf pdf] » [[Checkers]]
* [[Krzysztof Krawiec]], [[Marcin Szubert]] ('''2010'''). ''[http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=5586054 Coevolutionary Temporal Difference Learning for small-board Go]''. [[IEEE#EC|IEEE Congress on Evolutionary Computation]] » [[Go]]
* [[Krzysztof Krawiec]], [[Wojciech Jaśkowski]], [[Marcin Szubert]] ('''2011'''). ''[http://www.degruyter.com/view/j/amcs.2011.21.issue-4/v10006-011-0057-3/v10006-011-0057-3.xml Evolving small-board Go players using Coevolutionary Temporal Difference Learning with Archives]''. [http://www.degruyter.com/view/j/amcs Applied Mathematics and Computer Science], Vol. 21, No. 4
* [[Marcin Szubert]], [[Wojciech Jaśkowski]], [[Krzysztof Krawiec]] ('''2011'''). ''Learning Board Evaluation Function for Othello by Hybridizing Coevolution with Temporal Difference Learning''. [http://control.ibspan.waw.pl:3000/mainpage Control and Cybernetics], Vol. 40, No. 3, [http://www.cs.put.poznan.pl/wjaskowski/pub/papers/szubert2011learning.pdf pdf] » [[Othello]]
* [[Hamid Reza Maei]] ('''2011'''). ''[https://era.library.ualberta.ca/items/fd55edcb-ce47-4f84-84e2-be281d27b16a Gradient Temporal-Difference Learning Algorithms]''. Ph.D. thesis, [[University of Alberta]], advisor [[Richard Sutton]], [http://webdocs.cs.ualberta.ca/~sutton/papers/maei-thesis-2011.pdf pdf]
* [[Eli David|Omid David]], [[Moshe Koppel]], [[Nathan S. Netanyahu]] ('''2011'''). ''[https://link.springer.com/article/10.1007/s10710-010-9103-4 Expert-Driven Genetic Algorithms for Simulating Evaluation Functions]''. [https://www.springer.com/journal/10710 Genetic Programming and Evolvable Machines], Vol. 12, No. 1, [https://arxiv.org/abs/1711.06841 arXiv:1711.06841]
'''2012'''

Navigation menu