Changes

← Older edit

Michael L. Littman

809 bytes added, 17:32, 23 October 2018

no edit summary

==2010 ...==

* [[Thomas J. Walsh]], [[Sergiu Goschin]], [[Michael L. Littman]] ('''2010'''). ''Integrating sample-based planning and model-based reinforcement learning.'' [[AAAI]], [https://pdfs.semanticscholar.org/bdc9/bfb6ecc6fb5afb684df03d7220c46ebdbf4e.pdf pdf] » [[Monte-Carlo Tree Search|MCTS]], [[UCT]], [[Reinforcement Learning]]

* [[Thomas J. Walsh]], [[István Szita]], [[Carlos Diuk]], [[Michael L. Littman]] ('''2012'''). ''Exploring compact reinforcement-learning representations with linear regression''. [https://arxiv.org/abs/1205.2606 arXiv:1205.2606]

* [[Michael L. Littman]], [https://dblp.uni-trier.de/pers/hd/r/Ravi:Nishkam Nishkam Ravi], [https://dblp.uni-trier.de/pers/hd/t/Talwar:Arjun Arjun Talwar], [[Martin Zinkevich]] ('''2012'''). ''An Efficient Optimal-Equilibrium Algorithm for Two-player Game Trees''. [https://arxiv.org/abs/1206.6855 arXiv:1206.6855]

* [[Michael L. Littman]] ('''2012'''). ''Technical Perspective: A New Way to Search Game Trees''. [[ACM#Communications|Communications of the ACM]], Vol. 55, No. 3

* [[Ari Weinstein]], [[Michael L. Littman]] ('''2012'''). ''[https://www.aaai.org/ocs/index.php/ICAPS/ICAPS12/paper/view/4697 Bandit-Based Planning and Learning in Continuous-Action Markov Decision Processes]''. [http://dblp.uni-trier.de/db/conf/aips/icaps2012.html ICAPS 2012]

* [[Mathematician#MKearns|Michael Kearns]], [[Michael L. Littman]], [[Mathematician#SSingh|Satinder Singh]] ('''2013'''). ''Graphical Models for Game Theory''. [~~http~~https://~~dblp~~arxiv.~~uni-trier.de~~org/dbabs/~~journals/corr/corr1301~~1301.~~html#abs-1301-~~2281 ~~CoRR, January 2013], [https~~arXiv:~~//www~~1301.~~cis.upenn.edu/~mkearns/papers/graphgames.pdf pdf~~2281]* [[Anthony R. Cassandra]], [[Michael L. Littman]], [[Nevin L. Zhang]] ('''2013'''). ''Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes''. [~~http~~https://~~dblp~~arxiv.~~uni-trier.de~~org/dbabs/~~journals/corr/corr1302~~1302.~~html#abs-1302-~~1525 ~~CoRR, February 2013], [http~~arXiv:~~//arxiv.org/ftp/arxiv/papers/1302/~~1302.1525~~.pdf pdf~~]

* [[Ari Weinstein]], [[Michael L. Littman]], [[Sergiu Goschin]] ('''2013'''). ''[http://proceedings.mlr.press/v24/weinstein12a.html Rollout-based Game-tree Search Outprunes Traditional Alpha-beta]''. [http://proceedings.mlr.press/ PMLR], Vol. 24 » [[Monte-Carlo Tree Search|MCTS]], [[UCT]]

* [[Sergiu Goschin]], [[Ari Weinstein]], [[Michael L. Littman]] ('''2013'''). ''The Cross-Entropy Method Optimizes for Quantiles''. [http://dblp.uni-trier.de/db/conf/icml/icml2013.html ICML 2013]

* [[Ari Weinstein]], [[Michael L. Littman]] ('''2013'''). ''Open-Loop Planning in Large-Scale Stochastic Domains''. [[Conferences#AAAI-2013|AAAI-2013]]

* [https://dblp.uni-trier.de/pers/hd/a/Asadi:Kavosh Kavosh Asadi], [[Michael L. Littman]] ('''2016'''). ''An Alternative Softmax Operator for Reinforcement Learning''. [https://arxiv.org/abs/1612.05628 arXiv:1612.05628]

* [https://dblp.uni-trier.de/pers/hd/l/Lehnert:Lucas Lucas Lehnert], [[Michael L. Littman]] ('''2018'''). ''Transfer with Model Features in Reinforcement Learning''. [https://arxiv.org/abs/1807.01736 arXiv:1807.01736]

=External Links=

'''[[People|Up one Level]]'''

[[Category:Researcher|Littman]]

GerdIsenberg

Bureaucrats, Administrators

25,161

edits

Changes

Michael L. Littman

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools