Changes

Learning

2,802 bytes added, 09:58, 5 July 2021

no edit summary

* [[Ross Quinlan]] ('''1979'''). ''Discovering Rules by Induction from Large Collections of Examples''. Expert Systems in the Micro-electronic Age, pp. 168-201. Edinburgh University Press (Introducing ID3)

==1980 ...==

* [[Sarah E. Goldin]], [http://www.linkedin.com/pub/phil-klahr/8/72b/676/de Philip Klahr] ('''1981'''). ''[http://dl.acm.org/citation.cfm?id=1623197 Learning and Abstraction in Simulation]''. [~~http://www.informatik.uni-trier.de/~ley/db/conf/ijcai/ijcai81.html~~[Conferences#~~GoldinK81~~ IJCAI1981|IJCAI 1981]], [http://ijcai.org/Past%20Proceedings/IJCAI-81-VOL%201/PDF/042.pdf pdf]

* [[Paul E. Utgoff]], [[Tom Mitchell]] ('''1982'''). ''Acquisition of Appropriate Bias for Inductive Concept Learning''. [http://dblp.uni-trier.de/db/conf/aaai/aaai82.html#UtgoffM82 AAAI 1982], [https://www.aaai.org/Papers/AAAI/1982/AAAI82-099.pdf pdf]

* [[A. Harry Klopf]] ('''1982'''). ''The Hedonistic Neuron: A Theory of Memory, Learning, and Intelligence''. Hemisphere Publishing Corporation, [[University of Michigan]]

==1985 ...==

* [[Tony Marsland]] ('''1985'''). ''Evaluation-Function Factors''. [[ICGA Journal#8_2|ICCA Journal, Vol. 8, No. 2]], [http://webdocs.cs.ualberta.ca/~tony/OldPapers/evaluation.pdf pdf]

* [[Albrecht Heeffer]] ('''1985'''). ''Validating Concepts from Automated Acquisition Systems''. [[Conferences#~~IJCAI~~IJCAI1985|IJCAI 851985]], [http://ijcai.org/Past%20Proceedings/IJCAI-85-VOL1/PDF/118.pdf pdf]

* [[Hans Berliner]] ('''1985'''). ''Goals, Plans, and Mechanisms: Non-symbolically in an Evaluation Surface.'' Presentation at Evolution, Games, and Learning, Center for Nonlinear Studies, [[Los Alamos National Laboratory]], May 21.

* [[Ryszard Michalski]], [[Jaime Carbonell]], [[Tom Mitchell]] ('''1985, 2014'''). ''[https://www.elsevier.com/books/machine-learning/michalski/978-0-08-051054-5?gclid=EAIaIQobChMItc_hsp_34AIVUeR3Ch2l9QcDEAYYASABEgKW4_D_BwEMachine Learning: An Artificial Intelligence Approach, Volume I]''. [https://en.wikipedia.org/wiki/Morgan_Kaufmann_Publishers Morgan Kaufmann]

* [[Gerald Tesauro]], [[Terrence J. Sejnowski]] ('''1987'''). ''A 'Neural' Network that Learns to Play Backgammon''. [http://www.informatik.uni-trier.de/~ley/db/conf/nips/nips1987.html#TesauroS87 NIPS 1987]

* [[Alen Shapiro]] ('''1987'''). ''Structured Induction in Expert Systems''. Turing Institute Press in association with Addison-Wesley Publishing Company, Workingham, UK

* [[Alberto Maria Segre]] ('''1987'''). ''On the Operationality/Generality Trade-off in Explanation-based Learning''. [~~http://dblp.uni-trier.de/db/conf/ijcai/ijcai87.html~~ [Conferences#IJCAI1987|IJCAI 1987]], [http://ijcai.org/Past%20Proceedings/IJCAI-87-VOL1/PDF/049.pdf pdf]

* [[Alberto Maria Segre]] ('''1987'''). ''Explanation-Based Learning of Generalized Robot Assembly Plans''. Ph.D. thesis, [[University of Illinois at Urbana-Champaign]], Advisor: [http://www.ece.illinois.edu/directory/profile.asp?mrebl Gerald Francis DeJong, II]

* [[Eric B. Baum]], [https://en.wikipedia.org/wiki/Frank_Wilczek Frank Wilczek] ('''1987'''). ''[http://papers.nips.cc/paper/3-supervised-learning-of-probability-distributions-by-neural-networks Supervised Learning of Probability Distributions by Neural Networks]''. [http://papers.nips.cc/book/neural-information-processing-systems-1987 NIPS 1987]

* [[Kai-Fu Lee]], [[Sanjoy Mahajan]] ('''1988'''). ''[http://www.sciencedirect.com/science/article/pii/0004370288900768 A Pattern Classification Approach to Evaluation Function Learning]''. [https://en.wikipedia.org/wiki/Artificial_Intelligence_%28journal%29 Artificial Intelligence], Vol. 36, No. 1

* [[Paul E. Utgoff]] ('''1988'''). ''[http://dl.acm.org/citation.cfm?id=896712 ID5: An incremental ID3]''. [http://dblp.uni-trier.de/db/conf/icml/ml1988.html#Utgoff88 ML 1988]

* [[Shaul Markovitch]], [[Mathematician#PDScott|Paul D. Scott]] ('''1988'''). ''[https://www.semanticscholar.org/paper/The-Role-of-Forgetting-in-Learning-Markovitch-Scott/adbd75db1f85dd3545b4d6b8bba509bf20d7bfce The Role of Forgetting in Learning]''. [https://dblp.uni-trier.de/db/conf/icml/ml1988.html ML 1988], [http://www.cs.technion.ac.il/~shaulm/papers/pdf/Markovitch-Scott-icml1988.pdf pdf]

'''1989'''

* [[David E. Goldberg]] ('''1989'''). ''Genetic Algorithms in Search, Optimization and Machine Learning''. [https://en.wikipedia.org/wiki/Addison-Wesley Addison-Wesley]

* [[Gerald Tesauro]] ('''1992'''). ''[http://dl.acm.org/citation.cfm?id=139616 Practical Issues in Temporal Difference Learning]''. [https://en.wikipedia.org/wiki/Machine_Learning_(journal) Machine Learning], Vol. 8, No. 3-4

* [[Manuela Veloso]] ('''1992'''). ''[http://search.library.cmu.edu/vufind/Record/421096 Learning by Analogical Reasoning in General Purpose Problem Solving]''. Ph.D. thesis, [[Carnegie Mellon University]], advisor [[Jaime Carbonell]]

* [[Chris J. Thornton]] ('''1992'''). ''Techniques in Computational Learning: An Introduction''. [https://en.wikipedia.org/wiki/Chapman_%26_Hall Chapman & Hall]

'''1993'''

* [[Michael Gherrity]] ('''1993'''). ''A Game Learning Machine''. Ph.D. Thesis, [http://de.wikipedia.org/wiki/University_of_California,_San_Diego University of California, San Diego], [http://www.gherrity.org/thesis.ps.gz zipped ps]

* [[Shaul Markovitch]], [~~http://www.cs.huji.ac.il/labs/danss/Fairplay/~~ [Yaron Sella]] ('''1993'''). ''[https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1467-8640.1996.tb00254.x Learning of Resource Allocation Strategies for Game Playing]''. [[Conferences#IJCAI1993|IJCAI 1993]], ~~The proceedings of the 13th International Joint Conference on Artificial Intelligence, Chambery, France.~~ [~~http~~https://www.~~cs.technion~~ijcai.~~ac.il~~org/~~~shaulm~~Proceedings/~~papers~~93-2/~~pdf~~Papers/~~Markovitch-Sella-coin1996~~020.pdf pdf]* [[Mathematician#DCarmel|David Carmel]], [[Shaul Markovitch]] ('''1993'''). ''[https://aaai.org/Library/Symposia/Fall/1993/fs93-02-019.php Learning Models of Opponent's Strategy in Game Playing]''. [[Conferences#AAAI-93|AAAI1993]] ~~Proceedings~~, FS-93-02, [~~http~~https://~~citeseerx~~www.~~ist~~aaai.~~psu.edu~~org/Papers/Symposia/Fall/1993/~~viewdoc~~FS-93-02/~~summary?doi=10~~FS93-02-019.~~1.1.55.6488 CiteSeerX~~pdf pdf]* [[Mathematician#DGeiger|Dan Geiger]], [[Mathematician#APaz|Azaria Paz]], [[Judea Pearl]] ('''1993'''). ''Learning simple causal structures''. [http://onlinelibrary.wiley.com/journal/10.1002/%28ISSN%291098-111X International Journal of Intelligent Systems], Vol. 8~~, pp. 231-247.~~

* [[Sebastian Thrun]], [[Tom Mitchell]] ('''1993'''). ''Integrating Inductive Neural Network Learning and Explanation-Based Learning''. [[Conferences#IJCAI1993|IJCAI 1993]], [http://robots.stanford.edu/papers/thrun.EBNN_ijcai93.ps.gz zipped ps]

* [[Alois Heinz]], [[Christoph Hense]] ('''1993'''). ''[http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.56.872 Bootstrap learning of α-β-evaluation functions]''. [http://dblp.uni-trier.de/db/conf/icci/icci1993.html#HeinzH93 ICCI 1993], [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.56.872&rep=rep1&type=pdf pdf]

* [[Gerhard Mehlsam]], [[Hermann Kaindl]], [[Wilhelm Barth]] ('''1995'''). ''Feature Construction during Tree Learning''. [http://137.226.34.227/dblp/db/conf/gosler/gosler1995.html GOSLER Final Report] 1995: 391-403

* [[Chris McConnell]] ('''1995'''). ''Tuning Evaluation Functions for Search''. [http://www.cs.cmu.edu/afs/cs.cmu.edu/user/ccm/www/papers/ml.ps ps] or [http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=9B2A0CCA8B1AFB594A879799D974111A?doi=10.1.1.53.9742&rep=rep1&type=pdf pdf] from [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.53.9742 CiteSeerX]

* [[David Heckerman]], [[Mathematician#DGeiger|Dan Geiger]], [[Max Chickering]] ('''1995'''). ''Learning Bayesian Networks: The Combination of Knowledge and Statistical Data''. [https://en.wikipedia.org/wiki/Machine_Learning_%28journal%29 Machine Learning], Vol. 20, [http://research.microsoft.com/en-us/um/people/dmax/publications/ml95.pdf pdf]

* [[Tristan Cazenave]] ('''1995'''). ''Learning and Problem Solving in Gogol, a Go playing program''. [http://www.lamsade.dauphine.fr/~cazenave/papers/cazenave95learning.pdf pdf]

* [[Gerald Tesauro]] ('''1995'''). ''Temporal Difference Learning and TD-Gammon''. [[ACM#Communications|Communications of the ACM]] Vol. 38, No. 3

* [[Don Beal]], [[Martin C. Smith]] ('''1997'''). ''Learning Piece Values Using Temporal Differences''. [[ICGA Journal#20_3|ICCA Journal, Vol. 20, No. 3]]

* [[Kieran Greer]], [[Piyush Ojha]], [[David A. Bell]] ('''1997'''). ''Learning Search Heuristics from Examples: A Study in Computer Chess'', Seventh Conference of the Spanish Association for Artificial Intelligence, CAEPIA’97, November, pp. 695-704.

* [[Nir Friedman]], [[Moises Goldszmidt]], [[David Heckerman]], [[Stuart Russell]] ('''1997'''). ''Where is the Impact of Bayesian Networks in Learning?'' ~~In Proc. Fifteenth International Joint Conference on Artificial Intelligence, Nagoya, Japan~~[[Conferences#IJCAI1997|IJCAI 1997]], [http://www.cs.berkeley.edu/~russell/papers/ijcai97-challenge.ps ps]

* [[Ronald Parr]], [[Stuart Russell]] ('''1997'''). ''Reinforcement Learning with Hierarchies of Machines.'' In Advances in Neural Information Processing Systems 10, MIT Press, [http://www.cs.berkeley.edu/~russell/papers/nips97-ham.ps.gz zipped ps]

* [[Tristan Cazenave]] ('''1997'''). ''Gogol (an Analytical Learning Program)''. [~~http://www.ijcai.org/past/ijcai-97/~~ [Conferences#IJCAI1997|IJCAI~~'97~~1997]], [http://www.lamsade.dauphine.fr/~cazenave/papers/fost97.pdf pdf]

* [[Tom Mitchell]] ('''1997'''). ''[http://www.cs.cmu.edu/%7Etom/mlbook.html Machine Learning]''. [https://en.wikipedia.org/wiki/McGraw-Hill McGraw Hill]

* [[Michèle Sebag]] ('''1997'''). ''Stochastic Heuristics for Machine Learning & Machine Learning for Stochastic Optimization''. Habilitation, [https://en.wikipedia.org/wiki/Paris-Sud_11_University Paris-Sud 11 University]

* [[David Heckerman]] ('''1999'''). ''A tutorial on learning with Bayesian networks''. [http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=F04616607A620324B33D40A8ABB702CB?doi=10.1.1.15.4522&rep=rep1&type=pdf pdf] from [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.15.4522 CiteSeerX]

* [http://www2.lifl.fr/%7Edecomite/ F. De Comité], [http://www.lif.univ-mrs.fr/%7Efdenis/ F. Denis], [http://www.grappa.univ-lille3.fr/%7Egilleron/ R. Gilleron] et [[Fabien Letouzey]] ('''1999'''). ''Positive and Unlabeled Examples help Learning'', The 10th International Conference on Algorithmic Learning Theory, [http://www.cmi.univ-mrs.fr/%7Efdenis/alt99.ps ps]

* [http://www.ilsp.gr/homepages/papavasiliou_eng.html Vassilis Papavassiliou], [[Stuart Russell]] ('''1999'''). ''Convergence of reinforcement learning with general function approximators.'' ~~In Proc.~~ [[Conferences#IJCAI1999|IJCAI~~-99, Stockholm~~1999]], [http://www.cs.berkeley.edu/~russell/papers/ijcai99-bridge.ps ps]

* [[Philip G. K. Reiser]], [[Patricia J. Riddle]] ('''1999'''). ''[http://link.springer.com/chapter/10.1007%2F3-540-48873-1_19 Evolving Logic Programs to Classify Chess-Endgame Positions]''. [http://link.springer.com/book/10.1007%2F3-540-48873-1 Simulated Evolution and Learning], [https://en.wikipedia.org/wiki/Canberra Canberra], Australia. [http://www.springer.com/series/1244 Lecture Notes in Artificial Intelligence], No. 1585, [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer], [http://stancomb.co.uk/Papers/seal98.pdf pdf] » [[Endgame]]

* [[Marco Wiering]] ('''1999'''). ''[Explorations in Efficient Reinforcement Learning''. Ph.D. thesis, [https://en.wikipedia.org/wiki/University_of_Amsterdam University of Amsterdam], advisors [[Mathematician#FGroen|Frans Groen]] and [[Jürgen Schmidhuber]]

* [[Jonathan Baxter]], [[Andrew Tridgell]], [[Lex Weaver]] ('''2000'''). ''Learning to Play Chess Using Temporal Differences''. [http://www.dblp.org/db/journals/ml/ml40.html#BaxterTW00 Machine Learning, Vol 40, No. 3], [http://www.cs.princeton.edu/courses/archive/fall06/cos402/papers/chess-RL.pdf pdf]

* [[Michael Bain]], [[Stephen Muggleton]], [[Ashwin Srinivasan]] ('''2000'''). ''Generalising Closed World Specialisation: A Chess End Game Application''. [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.24.3499 CitySeerX]

* [[Chris J. Thornton]] ('''2000'''). ''[https://www.goodreads.com/book/show/1097454.Truth_from_Trash Truth from Trash: How Learning Makes Sense]''. [https://en.wikipedia.org/wiki/MIT_Press Bradford Books] <ref>[[Jean Hayes Michie]] ('''2001'''). ''[https://www.aaai.org/ojs/index.php/aimagazine/article/view/1599/0 Machine Learning and Light Relief: A Review of Truth from Trash]''. [[AAAI#AIMAG|AI Magazine]], Vol. 22 No. 4, [http://www.aaai.org/ojs/index.php/aimagazine/article/download/1599/1498 pdf]</ref>

'''2001'''

* [[Nicol N. Schraudolph]], [[Peter Dayan]], [[Terrence J. Sejnowski]] ('''2001'''). ''[http://nic.schraudolph.org/bib2html/b2hd-SchDaySej01.html Learning to Evaluate Go Positions via Temporal Difference Methods]''. [http://jasss.soc.surrey.ac.uk/7/1/reviews/takama.html Computational Intelligence in Games, Studies in Fuzziness and Soft Computing] [http://www.springer.com/economics?SGWID=1-165-6-73481-0 Physica-Verlag], revised version of [[Nicol N. Schraudolph#1993|1993 paper]]

* [[Jonathan Schaeffer]], [[Markian Hlynka]], [[Vili Jussila]] ('''2001'''). ''Temporal Difference Learning Applied to a High-Performance Game-Playing Program''. [~~http://www.informatik.uni-trier.de/~ley/db/conf/ijcai/ijcai2001.html~~[Conferences#~~SchaefferHJ01~~ IJCAI2001|IJCAI 2001]]* [[Michael Bowling]], [[Manuela Veloso|Manuela M. Veloso]] ('''2001'''). ''Rational and Convergent Learning in Stochastic Games''. [~~http://www.informatik.uni-trier.de/~ley/db/conf/ijcai/ijcai2001.html~~[Conferences#~~BowlingV01~~ IJCAI2001|IJCAI 2001]]* [[Levente Kocsis]], [[Jos Uiterwijk]], [[Jaap van den Herik]] ('''2001'''). ''Move Ordering using Neural Networks'', IEA/AIE 2001, LNCS 2070~~, 45-50~~ [http://zaphod.aml.sztaki.hu/papers/kocsis-IEA01.ps ps]

* [[Marty Hirsch]] ('''2001'''). ''Machine Learning in MChess Professional''. [[Advances in Computer Games 9]]

* [[Yngvi Björnsson]], [[Tony Marsland]] ('''2001'''). ''Learning Search Control in Adversary Games''. [[Advances in Computer Games 9]], pp. 157-174. [http://www.ru.is/faculty/yngvi/pdf/BjornssonM01b.pdf pdf]

* [[Jean Hayes Michie]] ('''2001'''). ''[http://www.aaai.org/ojs/index.php/aimagazine/article/view/1599/0 Machine Learning and Light Relief: A Review of Truth from Trash]''. [http://www.informatik.uni-trier.de/~ley/db/journals/aim/aim22.html#Michie01 AI Magazine Vol. 22 No. 4], [http://www.aaai.org/ojs/index.php/aimagazine/article/download/1599/1498 pdf]

* [[Pieter Spronck]], [[Ida Sprinkhuizen-Kuyper]], [[Eric Postma]] ('''2001'''). ''Infused Evolutionary Learning''. Proceedings of the Eleventh Belgian-Dutch Conference on Machine Learning, [http://www.cnts.ua.ac.be/benelearn2001/proceedings/bene01-spronck.pdf pdf], [http://ticc.uvt.nl/~pspronck/pubs/InfusedEvolutionaryLearning.pdf pdf]

* [[Charles Elkan]] ('''2001'''). ''The Foundations of Cost-Sensitive Learning''. [[Conferences#~~IJCAI~~IJCAI2001|IJCAI 2001]]

* [[Alex B. Meijer]], [[Henk Koppelaar]] ('''2001'''). ''[http://www.kbs.twi.tudelft.nl/Publications/Conference/2001/2001-MeijerKoppelaar-GAMEON01.html A learning architecture for the game of Go]''. [https://www.informs.org/Attend-a-Conference/Conference-Calendar/Game-On-2001 Game-On 2001]

* [[Johannes Fürnkranz]], [[Miroslav Kubat]] ('''2001'''). ''[https://www.novapublishers.com/catalog/product_info.php?products_id=720 Machines that Learn to Play Games]''. Advances in Computation: Theory and Practice, Vol. 8,. [https://en.wikipedia.org/wiki/Nova_Publishers NOVA Science Publishers]

* [[Pedro Campos]], [[Thibault Langlois]] ('''2003'''). ''[http://ilk.uvt.nl/icga/journal/contents/content26-4.htm#ABALEARN Abalearn: a Program that Learns How to Play Abalone]''. [[ICGA Journal#26_4|ICGA Journal, Vol. 26, No. 4]]

* [[David Gleich]] ('''2003'''). ''Machine Learning in Computer Chess: Genetic Programming and KRK''. [https://en.wikipedia.org/wiki/Harvey_Mudd_College Harvey Mudd College], [http://www.cs.purdue.edu/homes/dgleich/publications/Gleich%202003%20-%20Machine%20Learning%20in%20Computer%20Chess.pdf pdf]

* [[Henk Mannen]] ('''2003'''). ''Learning to play chess using reinforcement learning with database games''. Master’s thesis, ~~[http://students.uu.nl/en/hum/cognitive-artificial-intelligence~~ Cognitive Artiﬁcial Intelligence], [https://en.wikipedia.org/wiki/Utrecht_University Utrecht University], [https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.109.810&rep=rep1&type=pdf pdf]

* [[Jan Žižka]], [[Michal Mádr]] ('''2003'''). ''[https://www.muni.cz/research/publications/490371 Learning Representative Patterns from Real Chess Positions: A Case Study]''. [http://dblp.uni-trier.de/db/conf/iicai/iicai2003.html#ZizkaM03 IICAI 2003]

'''2004'''

* [[Daniel Osman]], [[Jacek Mańdziuk]] ('''2004'''). ''Comparison of TDLeaf and TD learning in Game Playing Domain''. [http://www.informatik.uni-trier.de/~ley/db/conf/iconip/iconip2004.html#OsmanM04 11. ICONIP], [http://www.mini.pw.edu.pl/~mandziuk/PRACE/ICONIP04.pdf pdf]

* [[Albert Xin Jiang]] ('''2004'''). ''Multiagent Reinforcement Learning in Stochastic Games with Continuous Action Spaces''. [http://www.cs.ubc.ca/%7Ejiang/papers/continuous.pdf pdf]

* [[Henk Mannen]], [[Marco Wiering]] ('''2004'''). ''[https://www.semanticscholar.org/paper/Learning-to-Play-Chess-using-TD(lambda)-learning-Mannen-Wiering/00a6f81c8ebe8408c147841f26ed27eb13fb07f3 Learning to play chess using TD(λ)-learning with database games]''. ~~[http://students.uu.nl/en/hum/cognitive-artificial-intelligence~~ Cognitive Artiﬁcial Intelligence], [https://en.wikipedia.org/wiki/Utrecht_University Utrecht University], Benelearn’04, [https://www.ai.rug.nl/~mwiering/GROUP/ARTICLES/learning-chess.pdf pdf]

==2005 ...==

* [[Dave Gomboc]], [[Michael Buro]], [[Tony Marsland]] ('''2005'''). ''Tuning evaluation functions by maximizing concordance'' Theoretical Computer Science, Volume 349, Issue 2, pp. 202-229, [http://www.cs.ualberta.ca/%7Emburo/ps/tcs-learn.pdf pdf]

* [[Sacha Droste]], [[Johannes Fürnkranz]] ('''2008'''). ''Learning of Piece Values for Chess Variants.'' Technical Report TUD–KE–2008-07, Knowledge Engineering Group, [[Darmstadt University of Technology|TU Darmstadt]], [http://www.ke.tu-darmstadt.de/publications/reports/tud-ke-2008-07.pdf pdf]

* [[Sacha Droste]], [[Johannes Fürnkranz]] ('''2008'''). ''Learning the Piece Values for three Chess Variants''. [[ICGA Journal#31_4|ICGA Journal, Vol 31, No. 4]]

* [[Richard Sutton]], [[Csaba Szepesvári]], [[Hamid Reza Maei]] ('''2008'''). ''A Convergent O(n) Algorithm for Off-policy Temporal-difference Learning with Linear Function Approximation''. [https://dblp.uni-trier.de/db/conf/nips/nips2008.html#SuttonSM08 NIPS 2008], [~~http~~https://~~www~~proceedings.~~sztaki~~neurips.hucc/paper/~~%7Eszcsaba~~2008/~~papers~~file/~~gtdnips08~~e0c641195b27425bb056ac56f8953d24-Paper.pdf pdf] ~~(draft)~~

* [[Matej Guid]], [[Martin Možina]], [[Jana Krivec]], [[Aleksander Sadikov]], [[Ivan Bratko]] ('''2008'''). ''[http://link.springer.com/chapter/10.1007/978-3-540-87608-3_18 Learning Positional Features for Annotating Chess Games: A Case Study]''. [[CG 2008]], [http://www.ailab.si/matej/doc/Learning_Positional_Features-Case_Study.pdf pdf]

* [[Martin Možina]], [[Matej Guid]], [[Jana Krivec]], [[Aleksander Sadikov]], [[Ivan Bratko]] ('''2008'''). ''Fighting Knowledge Acquisition Bottleneck with Argument Based Machine Learning''. 18th European Conference on Artificial Intelligence (ECAI 2008), Patras, Greece. [http://www.ailab.si/martin/abml/abml_expert_system_for_web.pdf pdf]

* [[Eli David|Omid David]] ('''2009'''). ''Genetic Algorithms Based Learning for Evolving Intelligent Organisms''. Ph.D. Thesis <ref>[[Dap Hartmann]] ('''2010'''). ''Mimicking the Black Box - Genetically evolving evaluation functions and search algorithms''. Review on Omid David's Ph.D. Thesis, [[ICGA Journal#33_1|ICGA Journal, Vol 33, No. 1]]</ref>

* [[Nur Merve Amil]], [[Nicolas Bredèche]], [[Christian Gagné]], [[Sylvain Gelly]], [[Marc Schoenauer]], [[Olivier Teytaud]] ('''2009'''). ''A Statistical Learning Perspective of Genetic Programming''. EuroGP 2009, [http://hal.inria.fr/docs/00/36/97/82/PDF/eurogp.pdf pdf]

* ~~[[Richard Sutton]],~~ [[Hamid Reza Maei]], [[~~Doina Precup~~Csaba Szepesvári]], [[Shalabh Bhatnagar]], [[~~David Silver~~Doina Precup]], [[~~Csaba Szepesvári~~David Silver]], [[~~Eric Wiewiora~~Richard Sutton]]. ('''2009'''). ''~~Fast Gradient-Descent Methods for~~ Convergent Temporal-Difference Learning with ~~Linear~~ Arbitrary Smooth Function Approximation''. ~~In Proceedings of the 26th International Conference on Machine Learning (ICML~~[https://dblp.uni-~~09)~~trier.de/db/conf/nips/nips2009. html#MaeiSBPSS09 NIPS 2009], [~~http~~https://~~www~~papers.~~sztaki~~nips.hucc/~~~szcsaba~~paper/~~papers~~2009/file/~~GTD~~3a15c7d0bbe60300a39f76f8a5ba6896-~~ICML09~~Paper.pdf pdf]

* [[Mesut Kirci]], [[Jonathan Schaeffer]], [[Nathan Sturtevant]] ('''2009'''). ''Feature Learning Using State Differences''. [http://web.cs.du.edu/~sturtevant/papers/GGPfeatures.pdf pdf]

* [[David Silver]], [[Gerald Tesauro]] ('''2009'''). ''Monte-Carlo Simulation Balancing''. [http://www.informatik.uni-trier.de/~ley/db/conf/icml/icml2009.html#SilverT09 ICML 2009], [http://www.machinelearning.org/archive/icml2009/papers/500.pdf pdf] <ref>[http://videolectures.net/icml09_silver_mcsb/ Monte-Carlo Simulation Balancing - videolectures.net] by [[David Silver]]</ref>

* [[Julien Pérez]], [[Cécile Germain-Renaud]], [[Balázs Kégl]], [[Charles Loomis]] ('''2010'''). ''Multi-objective Reinforcement Learning for Responsive Grids''. In The Journal of Grid Computing. [http://hal.archives-ouvertes.fr/docs/00/49/15/60/PDF/RLGrid_JGC09_V7.pdf pdf]

* [[Jean-Yves Audibert]] ('''2010'''). ''PAC-Bayesian aggregation and multi-armed bandits''. Habilitation thesis, [http://fr.wikipedia.org/wiki/Universit%C3%A9_Paris-Est Université Paris Est], [http://certis.enpc.fr/~audibert/Mes%20articles/hdr.pdf pdf], [http://certis.enpc.fr/~audibert/Mes%20articles/hdrSlides.pdf slides as pdf]

* [[Hamid Reza Maei]], [[Richard Sutton]] ('''2010'''). ''[~~http~~https://www.~~incompleteideas~~researchgate.net/~~sutton~~publication/~~publications.html#GQ~~ 215990384_GQlambda_A_general_gradient_algorithm_for_temporal-difference_prediction_learning_with_eligibility_traces GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces]''. ~~In Proceedings of the Third Conference on Artificial General Intelligence~~[https://agi-conf.org/2010/ AGI 2010]

* [http://www.informatik.uni-trier.de/~ley/db/indices/a-tree/w/Waledzik:Karol.html Karol Walędzik], [[Jacek Mańdziuk]] ('''2010'''). ''The Layered Learning method and its Application to Generation of Evaluation Functions for the Game of Checkers''. [http://www.informatik.uni-trier.de/~ley/db/conf/ppsn/ppsn2010-2.html#WaledzikM10 11. PPSN], [http://www.mini.pw.edu.pl/~mandziuk/PRACE/PPSN10.pdf pdf] » [[Checkers]]

* [[Krzysztof Krawiec]], [[Marcin Szubert]] ('''2010'''). ''[http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=5586054 Coevolutionary Temporal Difference Learning for small-board Go]''. [[IEEE#EC|IEEE Congress on Evolutionary Computation]] » [[Go]]

* [[Krzysztof Krawiec]], [[Wojciech Jaśkowski]], [[Marcin Szubert]] ('''2011'''). ''[http://www.degruyter.com/view/j/amcs.2011.21.issue-4/v10006-011-0057-3/v10006-011-0057-3.xml Evolving small-board Go players using Coevolutionary Temporal Difference Learning with Archives]''. [http://www.degruyter.com/view/j/amcs Applied Mathematics and Computer Science], Vol. 21, No. 4

* [[Marcin Szubert]], [[Wojciech Jaśkowski]], [[Krzysztof Krawiec]] ('''2011'''). ''Learning Board Evaluation Function for Othello by Hybridizing Coevolution with Temporal Difference Learning''. [http://control.ibspan.waw.pl:3000/mainpage Control and Cybernetics], Vol. 40, No. 3, [http://www.cs.put.poznan.pl/wjaskowski/pub/papers/szubert2011learning.pdf pdf] » [[Othello]]

* [[Hamid Reza Maei]] ('''2011'''). ''[https://era.library.ualberta.ca/items/fd55edcb-ce47-4f84-84e2-be281d27b16a Gradient Temporal-Difference Learning Algorithms]''. Ph.D. thesis, [[University of Alberta]], advisor [[Richard Sutton]~~], [http://webdocs.cs.ualberta.ca/~sutton/papers/maei-thesis-2011.pdf pdf~~]

* [[Eli David|Omid David]], [[Moshe Koppel]], [[Nathan S. Netanyahu]] ('''2011'''). ''[https://link.springer.com/article/10.1007/s10710-010-9103-4 Expert-Driven Genetic Algorithms for Simulating Evaluation Functions]''. [https://www.springer.com/journal/10710 Genetic Programming and Evolvable Machines], Vol. 12, No. 1, [https://arxiv.org/abs/1711.06841 arXiv:1711.06841]

'''2012'''

* [[Mathematician#ROrtner|Ronald Ortner]], [[Mathematician#DRyabko|Daniil Ryabko]], [[Peter Auer]], [[Rémi Munos]] ('''2014'''). ''Regret bounds for restless Markov bandits''. [https://en.wikipedia.org/wiki/Theoretical_Computer_Science_%28journal%29 Theoretical Computer Science] 558, [http://daniil.ryabko.net/mabajr.pdf pdf]

==2015 ...==

* [[Volodymyr Mnih]], [[Koray Kavukcuoglu]], [[David Silver]], [[Mathematician#AARusu|Andrei A. Rusu]], [[Joel Veness]], [[Marc G. Bellemare]], [[Alex Graves]], [[Martin Riedmiller]], [[Andreas K. Fidjeland]], [[Georg Ostrovski]], [[Stig Petersen]], [[Charles Beattie]], [[Amir Sadik]], [[Ioannis Antonoglou]], [[Helen King]], [[Dharshan Kumaran]], [[Daan Wierstra]], [[Shane Legg]], [[Demis Hassabis]] ('''2015'''). ''[http://www.nature.com/nature/journal/v518/n7540/abs/nature14236.html Human-level control through deep reinforcement learning]''. [https://en.wikipedia.org/wiki/Nature_%28journal%29 Nature], Vol. 518

* [[Tobias Graf]], [[Marco Platzner]] ('''2015'''). ''Adaptive Playouts in Monte Carlo Tree Search with Policy Gradient Reinforcement Learning''. [[Advances in Computer Games 14]]

* [[Yuichiro Sato]], [[Hiroyuki Iida]], [[Jaap van den Herik]] ('''2015'''). ''Transfer Learning by Inductive Logic Programming''. [[Advances in Computer Games 14]]

* [[Jialin Liu]], [[Olivier Teytaud]], [[Tristan Cazenave]] ('''2016'''). ''Fast seed-learning algorithms for games''. [[CG 2016]]

* [[Eli David|Omid E. David]], [[Nathan S. Netanyahu]], [[Lior Wolf]] ('''2016'''). ''[http://link.springer.com/chapter/10.1007%2F978-3-319-44781-0_11 DeepChess: End-to-End Deep Neural Network for Automatic Learning in Chess]''. [http://icann2016.org/ ICAAN 2016], [https://en.wikipedia.org/wiki/Lecture_Notes_in_Computer_Science Lecture Notes in Computer Science], Vol. 9887, [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer], [http://www.cs.tau.ac.il/~wolf/papers/deepchess.pdf pdf preprint] » [[DeepChess]] <ref>[http://www.talkchess.com/forum/viewtopic.php?t=61748 DeepChess: Another deep-learning based chess program] by [[Matthew Lai]], [[CCC]], October 17, 2016</ref> <ref>[http://icann2016.org/index.php/conference-programme/recipients-of-the-best-paper-awards/ ICANN 2016 | Recipients of the best paper awards]</ref>

* [~~https://www.linkedin.com/in/ian-goodfellow-b7187213~~ [Mathematician#IGoodfellow|Ian Goodfellow]], [~~https://en.wikipedia.org/wiki/Yoshua_Bengio~~ [Mathematician#YBengio|Yoshua Bengio]], [~~https://www.linkedin.com/in/aaron-courville-53a63459~~ [Mathematician#ACourville|Aaron Courville]] ('''2016'''). ''[http://www.deeplearningbook.org/ Deep Learning]''. [https://en.wikipedia.org/wiki/MIT_Press MIT Press]

* [[Max Jaderberg]], [[Volodymyr Mnih]], [[Wojciech Marian Czarnecki]], [[Tom Schaul]], [[Joel Z. Leibo]], [[David Silver]], [[Koray Kavukcuoglu]] ('''2016'''). ''Reinforcement Learning with Unsupervised Auxiliary Tasks''. [https://arxiv.org/abs/1611.05397v1 arXiv:1611.05397v1]

* [[Ian H. Witten]], [https://dblp.uni-trier.de/pers/hd/f/Frank:Eibe Eibe Frank], [https://dblp.uni-trier.de/pers/hd/h/Hall:Mark_A= Mark A. Hall], [http://www.professeurs.polymtl.ca/christopher.pal/ Christopher Pal] ('''2016'''). ''[https://www.cs.waikato.ac.nz/~ml/weka/book.html Data Mining: Practical Machine Learning Tools and Techniques]''. 4th Edition, [https://en.wikipedia.org/wiki/Morgan_Kaufmann_Publishers Morgan Kaufmann]

* [[Matthia Sabatelli]], [[Francesco Bidoia]], [[Valeriu Codreanu]], [[Marco Wiering]] ('''2018'''). ''Learning to Evaluate Chess Positions with Deep Neural Networks and Limited Lookahead''. ICPRAM 2018, [http://www.ai.rug.nl/~mwiering/GROUP/ARTICLES/ICPRAM_CHESS_DNN_2018.pdf pdf]

* [[Takeshi Ito]] ('''2018'''). ''Game learning support system based on future position''. [[CG 2018]], [[ICGA Journal#40_4|ICGA Journal, Vol. 40, No. 4]]

* [[Kristian Kersting]] ('''2018'''). ''[https://www.frontiersin.org/articles/10.3389/fdata.2018.00006/full Machine Learning and Artificial Intelligence: Two Fellow Travelers on the Quest for Intelligent Behavior in Machines]''. [https://www.frontiersin.org/journals/big-data# Frontiers in Big Data]

'''2019'''

* [[Herilalaina Rakotoarison]], [[Marc Schoenauer]], [[Michèle Sebag]] ('''2019'''). ''Automated Machine Learning with Monte-Carlo Tree Search''. [https://arxiv.org/abs/1906.00170 arXiv:1906.00170]

* [[Frank Hutter]], [https://dblp.org/pers/hd/k/Kotthoff:Lars Lars Kotthoff], [https://dblp.org/pers/hd/v/Vanschoren:Joaquin Joaquin Vanschoren] (eds.) ('''2019'''). ''[https://link.springer.com/book/10.1007%2F978-3-030-05318-5 Automated Machine Learning]''. [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer]

* [[Julian Schrittwieser]], [[Ioannis Antonoglou]], [[Thomas Hubert]], [[Karen Simonyan]], [[Laurent Sifre]], [[Simon Schmitt]], [[Arthur Guez]], [[Edward Lockhart]], [[Demis Hassabis]], [[Thore Graepel]], [[Timothy Lillicrap]], [[David Silver]] ('''2019'''). ''Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model''. [https://arxiv.org/abs/1911.08265 arXiv:1911.08265] <ref>[http://www.talkchess.com/forum3/viewtopic.php?f=2&t=72381 New DeepMind paper] by GregNeto, [[CCC]], November 21, 2019</ref>

==2020 ...==

* [[Julian Schrittwieser]], [[Ioannis Antonoglou]], [[Thomas Hubert]], [[Karen Simonyan]], [[Laurent Sifre]], [[Simon Schmitt]], [[Arthur Guez]], [[Edward Lockhart]], [[Demis Hassabis]], [[Thore Graepel]], [[Timothy Lillicrap]], [[David Silver]] ('''2020'''). ''[https://www.nature.com/articles/s41586-020-03051-4 Mastering Atari, Go, chess and shogi by planning with a learned model]''. [https://en.wikipedia.org/wiki/Nature_%28journal%29 Nature], Vol. 588 <ref>[https://deepmind.com/blog/article/muzero-mastering-go-chess-shogi-and-atari-without-rules?fbclid=IwAR3mSwrn1YXDKr9uuGm2GlFKh76wBilex7f8QvBiQecwiVmAvD6Bkyjx-rE MuZero: Mastering Go, chess, shogi and Atari without rules]</ref>

* [[Johannes Czech]], [[Moritz Willig]], [[Alena Beyer]], [[Kristian Kersting]], [[Johannes Fürnkranz]] ('''2020'''). ''[https://www.frontiersin.org/articles/10.3389/frai.2020.00024/full Learning to Play the Chess Variant Crazyhouse Above World Champion Level With Deep Neural Networks and Human Data]''. [https://www.frontiersin.org/journals/artificial-intelligence# Frontiers in Artificial Intelligence] » [[CrazyAra]]

=Forum Posts=

* [http://www.talkchess.com/forum/viewtopic.php?t=56313 Position learning and opening books] by Forrest Hoch, [[CCC]], May 11, 2015

* [http://www.talkchess.com/forum/viewtopic.php?t=61861 A database for learning evaluation functions] by [[Álvaro Begué]], [[CCC]], October 28, 2016 » [[Automated Tuning]], [[Evaluation]], [[Texel's Tuning Method]]

* [http://www.talkchess.com/forum3/viewtopic.php?f=7&t=69536 Interesting article in "Nature" on Machine Learning] by [[Kai Laskos]], [[CCC]], January 09, 2019

* [http://www.talkchess.com/forum3/viewtopic.php?f=7&t=72020 A book on machine learning] by Mehdi Amini, [[CCC]], October 06, 2019

GerdIsenberg

Bureaucrats, Administrators

25,161

edits

Changes

Learning

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools