Changes

Learning

290 bytes added, 16:54, 4 July 2020

no edit summary

* [[Ross Quinlan]] ('''1979'''). ''Discovering Rules by Induction from Large Collections of Examples''. Expert Systems in the Micro-electronic Age, pp. 168-201. Edinburgh University Press (Introducing ID3)

==1980 ...==

* [[Sarah E. Goldin]], [http://www.linkedin.com/pub/phil-klahr/8/72b/676/de Philip Klahr] ('''1981'''). ''[http://dl.acm.org/citation.cfm?id=1623197 Learning and Abstraction in Simulation]''. [~~http://www.informatik.uni-trier.de/~ley/db/conf/ijcai/ijcai81.html~~[Conferences#~~GoldinK81~~ IJCAI1981|IJCAI 1981]], [http://ijcai.org/Past%20Proceedings/IJCAI-81-VOL%201/PDF/042.pdf pdf]

* [[Paul E. Utgoff]], [[Tom Mitchell]] ('''1982'''). ''Acquisition of Appropriate Bias for Inductive Concept Learning''. [http://dblp.uni-trier.de/db/conf/aaai/aaai82.html#UtgoffM82 AAAI 1982], [https://www.aaai.org/Papers/AAAI/1982/AAAI82-099.pdf pdf]

* [[A. Harry Klopf]] ('''1982'''). ''The Hedonistic Neuron: A Theory of Memory, Learning, and Intelligence''. Hemisphere Publishing Corporation, [[University of Michigan]]

==1985 ...==

* [[Tony Marsland]] ('''1985'''). ''Evaluation-Function Factors''. [[ICGA Journal#8_2|ICCA Journal, Vol. 8, No. 2]], [http://webdocs.cs.ualberta.ca/~tony/OldPapers/evaluation.pdf pdf]

* [[Albrecht Heeffer]] ('''1985'''). ''Validating Concepts from Automated Acquisition Systems''. [[Conferences#~~IJCAI~~IJCAI1985|IJCAI 851985]], [http://ijcai.org/Past%20Proceedings/IJCAI-85-VOL1/PDF/118.pdf pdf]

* [[Hans Berliner]] ('''1985'''). ''Goals, Plans, and Mechanisms: Non-symbolically in an Evaluation Surface.'' Presentation at Evolution, Games, and Learning, Center for Nonlinear Studies, [[Los Alamos National Laboratory]], May 21.

* [[Ryszard Michalski]], [[Jaime Carbonell]], [[Tom Mitchell]] ('''1985, 2014'''). ''[https://www.elsevier.com/books/machine-learning/michalski/978-0-08-051054-5?gclid=EAIaIQobChMItc_hsp_34AIVUeR3Ch2l9QcDEAYYASABEgKW4_D_BwEMachine Learning: An Artificial Intelligence Approach, Volume I]''. [https://en.wikipedia.org/wiki/Morgan_Kaufmann_Publishers Morgan Kaufmann]

* [[Gerald Tesauro]], [[Terrence J. Sejnowski]] ('''1987'''). ''A 'Neural' Network that Learns to Play Backgammon''. [http://www.informatik.uni-trier.de/~ley/db/conf/nips/nips1987.html#TesauroS87 NIPS 1987]

* [[Alen Shapiro]] ('''1987'''). ''Structured Induction in Expert Systems''. Turing Institute Press in association with Addison-Wesley Publishing Company, Workingham, UK

* [[Alberto Maria Segre]] ('''1987'''). ''On the Operationality/Generality Trade-off in Explanation-based Learning''. [~~http://dblp.uni-trier.de/db/conf/ijcai/ijcai87.html~~ [Conferences#IJCAI1987|IJCAI 1987]], [http://ijcai.org/Past%20Proceedings/IJCAI-87-VOL1/PDF/049.pdf pdf]

* [[Alberto Maria Segre]] ('''1987'''). ''Explanation-Based Learning of Generalized Robot Assembly Plans''. Ph.D. thesis, [[University of Illinois at Urbana-Champaign]], Advisor: [http://www.ece.illinois.edu/directory/profile.asp?mrebl Gerald Francis DeJong, II]

* [[Eric B. Baum]], [https://en.wikipedia.org/wiki/Frank_Wilczek Frank Wilczek] ('''1987'''). ''[http://papers.nips.cc/paper/3-supervised-learning-of-probability-distributions-by-neural-networks Supervised Learning of Probability Distributions by Neural Networks]''. [http://papers.nips.cc/book/neural-information-processing-systems-1987 NIPS 1987]

* [[Bruce Abramson]] ('''1988'''). ''Learning Expected-Outcome Evaluators in Chess.'' Proceedings of the 1988 AAAI Spring Symposium Series: Computer Game Playing, 26-28.

* [[Richard Sutton]] ('''1988'''). ''Learning to Predict by the Methods of Temporal Differences''. [https://en.wikipedia.org/wiki/Machine_Learning_%28journal%29 Machine Learning], Vol. 3, No. 1, [https://webdocs.cs.ualberta.ca/~sutton/papers/sutton-88-with-erratum.pdf pdf]

* [[David E. Goldberg]], [[Mathematician#Holland|John H. Holland]] ('''1988'''). ''[~~http~~https://~~www~~link.~~springerlink~~springer.com/~~content~~article/~~rw3572714v41q507~~10.1023/ A:1022602019183 Genetic Algorithms and Machine Learning]''. [https://enwww.~~wikipedia~~springer.~~org~~com/~~wiki~~journal/~~Machine_Learning_%28journal%29~~ 10994 Machine Learning], Vol. 3

* [[Mathematician#KADeJong|Kenneth A. De Jong]], [[Mathematician#ACSchultz|Alan C. Schultz]] ('''1988'''). ''Using Experience-Based Learning in Game Playing''. Proceedings of the Fifth International Machine Learning Conference, [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.52.5381 CiteSeerX] » [[Othello]]

* [[Kai-Fu Lee]], [[Sanjoy Mahajan]] ('''1988'''). ''[http://www.sciencedirect.com/science/article/pii/0004370288900768 A Pattern Classification Approach to Evaluation Function Learning]''. [https://en.wikipedia.org/wiki/Artificial_Intelligence_%28journal%29 Artificial Intelligence], Vol. 36, No. 1

* [[Paul E. Utgoff]] ('''1988'''). ''[http://dl.acm.org/citation.cfm?id=896712 ID5: An incremental ID3]''. [http://dblp.uni-trier.de/db/conf/icml/ml1988.html#Utgoff88 ML 1988]

* [[Shaul Markovitch]], [[Mathematician#PDScott|Paul D. Scott]] ('''1988'''). ''[https://www.semanticscholar.org/paper/The-Role-of-Forgetting-in-Learning-Markovitch-Scott/adbd75db1f85dd3545b4d6b8bba509bf20d7bfce The Role of Forgetting in Learning]''. [https://dblp.uni-trier.de/db/conf/icml/ml1988.html ML 1988], [http://www.cs.technion.ac.il/~shaulm/papers/pdf/Markovitch-Scott-icml1988.pdf pdf]

'''1989'''

* [[David E. Goldberg]] ('''1989'''). ''Genetic Algorithms in Search, Optimization and Machine Learning''. [https://en.wikipedia.org/wiki/Addison-Wesley Addison-Wesley]

* [[Robert Levinson]] ('''1989'''). ''A Self-Learning, Pattern-Oriented Chess Program''. [[ICGA Journal#12_4|ICCA Journal, Vol. 12, No. 4]]

* [[Bruce Abramson]] ('''1989'''). ''On Learning and Testing Evaluation Functions.'' Proceedings of the Sixth Israeli Conference on Artificial Intelligence, 1989, 7-16.

'''1993'''

* [[Michael Gherrity]] ('''1993'''). ''A Game Learning Machine''. Ph.D. Thesis, [http://de.wikipedia.org/wiki/University_of_California,_San_Diego University of California, San Diego], [http://www.gherrity.org/thesis.ps.gz zipped ps]

* [[Shaul Markovitch]], [~~http://www.cs.huji.ac.il/labs/danss/Fairplay/~~ [Yaron Sella]] ('''1993'''). ''[https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1467-8640.1996.tb00254.x Learning of Resource Allocation Strategies for Game Playing]''. [[Conferences#IJCAI1993|IJCAI 1993]], ~~The proceedings of the 13th International Joint Conference on Artificial Intelligence, Chambery, France.~~ [~~http~~https://www.~~cs.technion~~ijcai.~~ac.il~~org/~~~shaulm~~Proceedings/~~papers~~93-2/~~pdf~~Papers/~~Markovitch-Sella-coin1996~~020.pdf pdf]* [[Mathematician#DCarmel|David Carmel]], [[Shaul Markovitch]] ('''1993'''). ''[https://aaai.org/Library/Symposia/Fall/1993/fs93-02-019.php Learning Models of Opponent's Strategy in Game Playing]''. [[Conferences#AAAI-93|AAAI1993]] ~~Proceedings~~, FS-93-02, [~~http~~https://~~citeseerx~~www.~~ist~~aaai.~~psu.edu~~org/Papers/Symposia/Fall/1993/~~viewdoc~~FS-93-02/~~summary?doi=10~~FS93-02-019.~~1.1.55.6488 CiteSeerX~~pdf pdf]* [[Dan Geiger]], [[Azaria Paz]], [[Judea Pearl]] ('''1993'''). ''Learning simple causal structures''. [http://onlinelibrary.wiley.com/journal/10.1002/%28ISSN%291098-111X International Journal of Intelligent Systems], Vol. 8~~, pp. 231-247.~~

* [[Sebastian Thrun]], [[Tom Mitchell]] ('''1993'''). ''Integrating Inductive Neural Network Learning and Explanation-Based Learning''. [[Conferences#IJCAI1993|IJCAI 1993]], [http://robots.stanford.edu/papers/thrun.EBNN_ijcai93.ps.gz zipped ps]

* [[Alois Heinz]], [[Christoph Hense]] ('''1993'''). ''[http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.56.872 Bootstrap learning of α-β-evaluation functions]''. [http://dblp.uni-trier.de/db/conf/icci/icci1993.html#HeinzH93 ICCI 1993], [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.56.872&rep=rep1&type=pdf pdf]

* [[Don Beal]], [[Martin C. Smith]] ('''1997'''). ''Learning Piece Values Using Temporal Differences''. [[ICGA Journal#20_3|ICCA Journal, Vol. 20, No. 3]]

* [[Kieran Greer]], [[Piyush Ojha]], [[David A. Bell]] ('''1997'''). ''Learning Search Heuristics from Examples: A Study in Computer Chess'', Seventh Conference of the Spanish Association for Artificial Intelligence, CAEPIA’97, November, pp. 695-704.

* [[Nir Friedman]], [[Moises Goldszmidt]], [[David Heckerman]], [[Stuart Russell]] ('''1997'''). ''Where is the Impact of Bayesian Networks in Learning?'' ~~In Proc. Fifteenth International Joint Conference on Artificial Intelligence, Nagoya, Japan~~[[Conferences#IJCAI1997|IJCAI 1997]], [http://www.cs.berkeley.edu/~russell/papers/ijcai97-challenge.ps ps]

* [[Ronald Parr]], [[Stuart Russell]] ('''1997'''). ''Reinforcement Learning with Hierarchies of Machines.'' In Advances in Neural Information Processing Systems 10, MIT Press, [http://www.cs.berkeley.edu/~russell/papers/nips97-ham.ps.gz zipped ps]

* [[Tristan Cazenave]] ('''1997'''). ''Gogol (an Analytical Learning Program)''. [~~http://www.ijcai.org/past/ijcai-97/~~ [Conferences#IJCAI1997|IJCAI~~'97~~1997]], [http://www.lamsade.dauphine.fr/~cazenave/papers/fost97.pdf pdf]

* [[Tom Mitchell]] ('''1997'''). ''[http://www.cs.cmu.edu/%7Etom/mlbook.html Machine Learning]''. [https://en.wikipedia.org/wiki/McGraw-Hill McGraw Hill]

* [[Michèle Sebag]] ('''1997'''). ''Stochastic Heuristics for Machine Learning & Machine Learning for Stochastic Optimization''. Habilitation, [https://en.wikipedia.org/wiki/Paris-Sud_11_University Paris-Sud 11 University]

* [[David Heckerman]] ('''1999'''). ''A tutorial on learning with Bayesian networks''. [http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=F04616607A620324B33D40A8ABB702CB?doi=10.1.1.15.4522&rep=rep1&type=pdf pdf] from [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.15.4522 CiteSeerX]

* [http://www2.lifl.fr/%7Edecomite/ F. De Comité], [http://www.lif.univ-mrs.fr/%7Efdenis/ F. Denis], [http://www.grappa.univ-lille3.fr/%7Egilleron/ R. Gilleron] et [[Fabien Letouzey]] ('''1999'''). ''Positive and Unlabeled Examples help Learning'', The 10th International Conference on Algorithmic Learning Theory, [http://www.cmi.univ-mrs.fr/%7Efdenis/alt99.ps ps]

* [http://www.ilsp.gr/homepages/papavasiliou_eng.html Vassilis Papavassiliou], [[Stuart Russell]] ('''1999'''). ''Convergence of reinforcement learning with general function approximators.'' ~~In Proc.~~ [[Conferences#IJCAI1999|IJCAI~~-99, Stockholm~~1999]], [http://www.cs.berkeley.edu/~russell/papers/ijcai99-bridge.ps ps]

* [[Philip G. K. Reiser]], [[Patricia J. Riddle]] ('''1999'''). ''[http://link.springer.com/chapter/10.1007%2F3-540-48873-1_19 Evolving Logic Programs to Classify Chess-Endgame Positions]''. [http://link.springer.com/book/10.1007%2F3-540-48873-1 Simulated Evolution and Learning], [https://en.wikipedia.org/wiki/Canberra Canberra], Australia. [http://www.springer.com/series/1244 Lecture Notes in Artificial Intelligence], No. 1585, [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer], [http://stancomb.co.uk/Papers/seal98.pdf pdf] » [[Endgame]]

* [[Marco Wiering]] ('''1999'''). ''[Explorations in Efficient Reinforcement Learning''. Ph.D. thesis, [https://en.wikipedia.org/wiki/University_of_Amsterdam University of Amsterdam], advisors [[Mathematician#FGroen|Frans Groen]] and [[Jürgen Schmidhuber]]

'''2001'''

* [[Nicol N. Schraudolph]], [[Peter Dayan]], [[Terrence J. Sejnowski]] ('''2001'''). ''[http://nic.schraudolph.org/bib2html/b2hd-SchDaySej01.html Learning to Evaluate Go Positions via Temporal Difference Methods]''. [http://jasss.soc.surrey.ac.uk/7/1/reviews/takama.html Computational Intelligence in Games, Studies in Fuzziness and Soft Computing] [http://www.springer.com/economics?SGWID=1-165-6-73481-0 Physica-Verlag], revised version of [[Nicol N. Schraudolph#1993|1993 paper]]

* [[Jonathan Schaeffer]], [[Markian Hlynka]], [[Vili Jussila]] ('''2001'''). ''Temporal Difference Learning Applied to a High-Performance Game-Playing Program''. [~~http://www.informatik.uni-trier.de/~ley/db/conf/ijcai/ijcai2001.html~~[Conferences#~~SchaefferHJ01~~ IJCAI2001|IJCAI 2001]]* [[Michael Bowling]], [[Manuela Veloso|Manuela M. Veloso]] ('''2001'''). ''Rational and Convergent Learning in Stochastic Games''. [~~http://www.informatik.uni-trier.de/~ley/db/conf/ijcai/ijcai2001.html~~[Conferences#~~BowlingV01~~ IJCAI2001|IJCAI 2001]]* [[Levente Kocsis]], [[Jos Uiterwijk]], [[Jaap van den Herik]] ('''2001'''). ''Move Ordering using Neural Networks'', IEA/AIE 2001, LNCS 2070~~, 45-50~~ [http://zaphod.aml.sztaki.hu/papers/kocsis-IEA01.ps ps]

* [[Marty Hirsch]] ('''2001'''). ''Machine Learning in MChess Professional''. [[Advances in Computer Games 9]]

* [[Yngvi Björnsson]], [[Tony Marsland]] ('''2001'''). ''Learning Search Control in Adversary Games''. [[Advances in Computer Games 9]], pp. 157-174. [http://www.ru.is/faculty/yngvi/pdf/BjornssonM01b.pdf pdf]

* [[Jean Hayes Michie]] ('''2001'''). ''[http://www.aaai.org/ojs/index.php/aimagazine/article/view/1599/0 Machine Learning and Light Relief: A Review of Truth from Trash]''. [http://www.informatik.uni-trier.de/~ley/db/journals/aim/aim22.html#Michie01 AI Magazine Vol. 22 No. 4], [http://www.aaai.org/ojs/index.php/aimagazine/article/download/1599/1498 pdf]

* [[Pieter Spronck]], [[Ida Sprinkhuizen-Kuyper]], [[Eric Postma]] ('''2001'''). ''Infused Evolutionary Learning''. Proceedings of the Eleventh Belgian-Dutch Conference on Machine Learning, [http://www.cnts.ua.ac.be/benelearn2001/proceedings/bene01-spronck.pdf pdf], [http://ticc.uvt.nl/~pspronck/pubs/InfusedEvolutionaryLearning.pdf pdf]

* [[Charles Elkan]] ('''2001'''). ''The Foundations of Cost-Sensitive Learning''. [[Conferences#~~IJCAI~~IJCAI2001|IJCAI 2001]]

* [[Alex B. Meijer]], [[Henk Koppelaar]] ('''2001'''). ''[http://www.kbs.twi.tudelft.nl/Publications/Conference/2001/2001-MeijerKoppelaar-GAMEON01.html A learning architecture for the game of Go]''. [https://www.informs.org/Attend-a-Conference/Conference-Calendar/Game-On-2001 Game-On 2001]

* [[Johannes Fürnkranz]], [[Miroslav Kubat]] ('''2001'''). ''[https://www.novapublishers.com/catalog/product_info.php?products_id=720 Machines that Learn to Play Games]''. Advances in Computation: Theory and Practice, Vol. 8,. [https://en.wikipedia.org/wiki/Nova_Publishers NOVA Science Publishers]

* [[Mathematician#ROrtner|Ronald Ortner]], [[Mathematician#DRyabko|Daniil Ryabko]], [[Peter Auer]], [[Rémi Munos]] ('''2014'''). ''Regret bounds for restless Markov bandits''. [https://en.wikipedia.org/wiki/Theoretical_Computer_Science_%28journal%29 Theoretical Computer Science] 558, [http://daniil.ryabko.net/mabajr.pdf pdf]

==2015 ...==

* [[Volodymyr Mnih]], [[Koray Kavukcuoglu]], [[David Silver]], [[Mathematician#AARusu|Andrei A. Rusu]], [[Joel Veness]], [[Marc G. Bellemare]], [[Alex Graves]], [[Martin Riedmiller]], [[Andreas K. Fidjeland]], [[Georg Ostrovski]], [[Stig Petersen]], [[Charles Beattie]], [[Amir Sadik]], [[Ioannis Antonoglou]], [[Helen King]], [[Dharshan Kumaran]], [[Daan Wierstra]], [[Shane Legg]], [[Demis Hassabis]] ('''2015'''). ''[http://www.nature.com/nature/journal/v518/n7540/abs/nature14236.html Human-level control through deep reinforcement learning]''. [https://en.wikipedia.org/wiki/Nature_%28journal%29 Nature], Vol. 518

* [[Tobias Graf]], [[Marco Platzner]] ('''2015'''). ''Adaptive Playouts in Monte Carlo Tree Search with Policy Gradient Reinforcement Learning''. [[Advances in Computer Games 14]]

* [[Yuichiro Sato]], [[Hiroyuki Iida]], [[Jaap van den Herik]] ('''2015'''). ''Transfer Learning by Inductive Logic Programming''. [[Advances in Computer Games 14]]

* [[Jialin Liu]], [[Olivier Teytaud]], [[Tristan Cazenave]] ('''2016'''). ''Fast seed-learning algorithms for games''. [[CG 2016]]

* [[Eli David|Omid E. David]], [[Nathan S. Netanyahu]], [[Lior Wolf]] ('''2016'''). ''[http://link.springer.com/chapter/10.1007%2F978-3-319-44781-0_11 DeepChess: End-to-End Deep Neural Network for Automatic Learning in Chess]''. [http://icann2016.org/ ICAAN 2016], [https://en.wikipedia.org/wiki/Lecture_Notes_in_Computer_Science Lecture Notes in Computer Science], Vol. 9887, [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer], [http://www.cs.tau.ac.il/~wolf/papers/deepchess.pdf pdf preprint] » [[DeepChess]] <ref>[http://www.talkchess.com/forum/viewtopic.php?t=61748 DeepChess: Another deep-learning based chess program] by [[Matthew Lai]], [[CCC]], October 17, 2016</ref> <ref>[http://icann2016.org/index.php/conference-programme/recipients-of-the-best-paper-awards/ ICANN 2016 | Recipients of the best paper awards]</ref>

* [~~https://www.linkedin.com/in/ian-goodfellow-b7187213~~ [Mathematician#IGoodfellow|Ian Goodfellow]], [~~https://en.wikipedia.org/wiki/Yoshua_Bengio~~ [Mathematician#YBengio|Yoshua Bengio]], [~~https://www.linkedin.com/in/aaron-courville-53a63459~~ [Mathematician#ACourville|Aaron Courville]] ('''2016'''). ''[http://www.deeplearningbook.org/ Deep Learning]''. [https://en.wikipedia.org/wiki/MIT_Press MIT Press]

* [[Max Jaderberg]], [[Volodymyr Mnih]], [[Wojciech Marian Czarnecki]], [[Tom Schaul]], [[Joel Z. Leibo]], [[David Silver]], [[Koray Kavukcuoglu]] ('''2016'''). ''Reinforcement Learning with Unsupervised Auxiliary Tasks''. [https://arxiv.org/abs/1611.05397v1 arXiv:1611.05397v1]

* [[Ian H. Witten]], [https://dblp.uni-trier.de/pers/hd/f/Frank:Eibe Eibe Frank], [https://dblp.uni-trier.de/pers/hd/h/Hall:Mark_A= Mark A. Hall], [http://www.professeurs.polymtl.ca/christopher.pal/ Christopher Pal] ('''2016'''). ''[https://www.cs.waikato.ac.nz/~ml/weka/book.html Data Mining: Practical Machine Learning Tools and Techniques]''. 4th Edition, [https://en.wikipedia.org/wiki/Morgan_Kaufmann_Publishers Morgan Kaufmann]

GerdIsenberg

Bureaucrats, Administrators

25,161

edits

Changes

Learning

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools