Changes

Jump to: navigation, search

Michael L. Littman

810 bytes added, 17:32, 23 October 2018
no edit summary
* [[Michael L. Littman]] ('''1996'''). ''Algorithms for Sequential Decision Making'' . Ph.D. thesis, [https://en.wikipedia.org/wiki/Brown_University Brown University], [http://www.cs.rutgers.edu/~mlittman/papers/thesis-with-gammas.pdf pdf]
* [[Mathematician#LPKaelbling|Leslie Pack Kaelbling]], [[Michael L. Littman]], [[Mathematician#AWMoore|Andrew W. Moore]] ('''1996'''). ''[http://www.cs.washington.edu/research/jair/volume4/kaelbling96a-html/rl-survey.html Reinforcement Learning: A Survey]''. [http://www.jair.org/vol/vol4.html JAIR, Vol. 4], [http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a.pdf pdf]
* [[Michael L. Littman]] ('''1997'''). ''Probabilistic Propositional Planning: Representations and Complexity''. [[Conferences#AAAI-97|AAAI-1997]]
* [[Michael L. Littman]], [[Mathematician#JGoldsmith|Judy Goldsmith]], [[Mathematician#MMundhenk|Martin Mundhenk]] ('''1998'''). ''The Computational Complexity of Probabilistic Planning''. [http://dblp.uni-trier.de/db/journals/jair/jair9.html#LittmanGM98 JAIR, Vol. 9], [http://arxiv.org/pdf/cs/9808101.pdf pdf]
* [[Michael L. Littman]] ('''1999'''). ''Initial Experiments in Stochastic Satisfiability''. [[Conferences#AAAI-99|AAAI-1999]
==2010 ...==
* [[Thomas J. Walsh]], [[Sergiu Goschin]], [[Michael L. Littman]] ('''2010'''). ''Integrating sample-based planning and model-based reinforcement learning.'' [[AAAI]], [https://pdfs.semanticscholar.org/bdc9/bfb6ecc6fb5afb684df03d7220c46ebdbf4e.pdf pdf] » [[Monte-Carlo Tree Search|MCTS]], [[UCT]], [[Reinforcement Learning]]
* [[Thomas J. Walsh]], [[István Szita]], [[Carlos Diuk]], [[Michael L. Littman]] ('''2012'''). ''Exploring compact reinforcement-learning representations with linear regression''. [https://arxiv.org/abs/1205.2606 arXiv:1205.2606]
* [[Michael L. Littman]], [https://dblp.uni-trier.de/pers/hd/r/Ravi:Nishkam Nishkam Ravi], [https://dblp.uni-trier.de/pers/hd/t/Talwar:Arjun Arjun Talwar], [[Martin Zinkevich]] ('''2012'''). ''An Efficient Optimal-Equilibrium Algorithm for Two-player Game Trees''. [https://arxiv.org/abs/1206.6855 arXiv:1206.6855]
* [[Michael L. Littman]] ('''2012'''). ''Technical Perspective: A New Way to Search Game Trees''. [[ACM#Communications|Communications of the ACM]], Vol. 55, No. 3
* [[Ari Weinstein]], [[Michael L. Littman]] ('''2012'''). ''[https://www.aaai.org/ocs/index.php/ICAPS/ICAPS12/paper/view/4697 Bandit-Based Planning and Learning in Continuous-Action Markov Decision Processes]''. [http://dblp.uni-trier.de/db/conf/aips/icaps2012.html ICAPS 2012]
* [[Mathematician#MKearns|Michael Kearns]], [[Michael L. Littman]], [[Mathematician#SSingh|Satinder Singh]] ('''2013'''). ''Graphical Models for Game Theory''. [httphttps://dblparxiv.uni-trier.deorg/dbabs/journals/corr/corr13011301.html#abs-1301-2281 CoRR, January 2013], [httpsarXiv://www1301.cis.upenn.edu/~mkearns/papers/graphgames.pdf pdf2281]* [[Anthony R. Cassandra]], [[Michael L. Littman]], [[Nevin L. Zhang]] ('''2013'''). ''Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes''. [httphttps://dblparxiv.uni-trier.deorg/dbabs/journals/corr/corr13021302.html#abs-1302-1525 CoRR, February 2013], [httparXiv://arxiv.org/ftp/arxiv/papers/1302/1302.1525.pdf pdf]
* [[Ari Weinstein]], [[Michael L. Littman]], [[Sergiu Goschin]] ('''2013'''). ''[http://proceedings.mlr.press/v24/weinstein12a.html Rollout-based Game-tree Search Outprunes Traditional Alpha-beta]''. [http://proceedings.mlr.press/ PMLR], Vol. 24 » [[Monte-Carlo Tree Search|MCTS]], [[UCT]]
* [[Sergiu Goschin]], [[Ari Weinstein]], [[Michael L. Littman]] ('''2013'''). ''The Cross-Entropy Method Optimizes for Quantiles''. [http://dblp.uni-trier.de/db/conf/icml/icml2013.html ICML 2013]
* [[Ari Weinstein]], [[Michael L. Littman]] ('''2013'''). ''Open-Loop Planning in Large-Scale Stochastic Domains''. [[Conferences#AAAI-2013|AAAI-2013]]
* [https://dblp.uni-trier.de/pers/hd/a/Asadi:Kavosh Kavosh Asadi], [[Michael L. Littman]] ('''2016'''). ''An Alternative Softmax Operator for Reinforcement Learning''. [https://arxiv.org/abs/1612.05628 arXiv:1612.05628]
* [https://dblp.uni-trier.de/pers/hd/l/Lehnert:Lucas Lucas Lehnert], [[Michael L. Littman]] ('''2018'''). ''Transfer with Model Features in Reinforcement Learning''. [https://arxiv.org/abs/1807.01736 arXiv:1807.01736]
=External Links=
'''[[People|Up one Level]]'''
[[Category:Researcher|Littman]]

Navigation menu