Revision as of 14:05, 23 October 2018

Home * People * Michael L. Littman

Michael L. Littman ^[1]

Michael Lederman Littman,
an American mathematician, computer scientist and professor of CS at Brown University, and before at Rutgers University and Duke University. He received his Ph.D. in 1996 at Brown University on Algorithms for Sequential Decision Making under advisor Leslie P. Kaelbling ^[2] . His research interests focus on stochastic games and reinforcement learning along with the related Markov Models for decision making, such as Markov Chain, Hidden Markov Model (HMM), Markov Decision Process (MDP), and Partially Observable Markov Decision Process (POMDP).

Crossword Solver

During his time at Duke University in the late 90s, Michael L. Littman worked on an automated crossword solver Proverb ^[3] ^[4], which won an Outstanding Paper Award in 1999 from AAAI ^[5] ^[6] and competed in the American Crossword Puzzle Tournament ^[7] ^[8].

Markov Models

Michael Littman's explanatory grid on Markov Models ^[9] :

Markov Models		Do we have control over the state transitons?
Markov Models		NO	YES
Are the states completely observable?	YES	Markov Chain	MDP
Are the states completely observable?	NO	HMM	POMDP

Selected Publications

^[10]

1990 ...

Justin A. Boyan, Michael L. Littman (1993). Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach. NIPS 1993, pdf
Michael L. Littman (1994). Markov Games as a Framework for Multi-Agent Reinforcement Learning. International Conference on Machine Learning, pdf
Michael L. Littman (1996). Algorithms for Sequential Decision Making . Ph.D. thesis, Brown University, pdf
Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore (1996). Reinforcement Learning: A Survey. JAIR, Vol. 4, pdf
Michael L. Littman (1997). Probabilistic Propositional Planning: Representations and Complexity. AAAI-1997
Michael L. Littman, Judy Goldsmith, Martin Mundhenk (1998). The Computational Complexity of Probabilistic Planning. JAIR, Vol. 9, pdf
Michael L. Littman (1999). Initial Experiments in Stochastic Satisfiability. [[Conferences#AAAI-99|AAAI-1999]
Michael L. Littman, Greg A. Keim, Noam M. Shazeer (1999). Solving Crosswords with PROVERB. AAAI-1999
Noam M. Shazeer, Michael L. Littman, Greg A. Keim (1999). Solving Crossword Puzzles as Probabilistic Constraint Satisfaction. AAAI-1999, CiteSeerX
Greg A. Keim, Noam M. Shazeer, Michael L. Littman, Sushant Agarwal, Catherine M. Cheves, Joseph Fitzgerald, Jason Grosland, Fan Jiang, Shannon Pollard, Karl Weinmeister (1999). PROVERB: The Probabilistic Cruciverbalist. AAAI-1999, pdf

2000 ...

Sebastian Thrun, Michael L. Littman (2000). A Review of Reinforcement Learning. AI Magazine, Vol. 21, No. 1, pdf
Michael L. Littman (2000). Review: Computer Language Games. CG 2000
Jiefu Shi, Michael L. Littman (2000). Abstraction Methods for Game Theoretic Poker. CG 2000
Michael L. Littman, Richard Sutton, Satinder Singh (2001). Predictive Representations of State. NIPS 2001, pdf
Michael L. Littman (2001). Value-function reinforcement learning in Markov games. Cognitive Systems Research, Vol. 2, pdf
Michael L. Littman, Greg A. Keim, Noam M. Shazeer (2002). A probabilistic approach to solving crossword puzzles. Artificial Intelligence, Vol. 134, pdf
Peter Stone, Robert Schapire, Michael L. Littman, János A. Csirik, David McAllester (2003). Decision-Theoretic Bidding Based on Learned Density Models in Simultaneous, Interacting Auctions. JAIR, Vol. 19, pdf
Michael L. Littman (2008). Autonomous Model Learning for Reinforcement Learning. QEST 2008

2010 ...

Thomas J. Walsh, Sergiu Goschin, Michael L. Littman (2010). Integrating sample-based planning and model-based reinforcement learning. AAAI, pdf » MCTS, UCT, Reinforcement Learning
Thomas J. Walsh, István Szita, Carlos Diuk, Michael L. Littman (2012). Exploring compact reinforcement-learning representations with linear regression. arXiv:1205.2606
Michael L. Littman (2012). Technical Perspective: A New Way to Search Game Trees. Communications of the ACM, Vol. 55, No. 3
Ari Weinstein, Michael L. Littman (2012). Bandit-Based Planning and Learning in Continuous-Action Markov Decision Processes. ICAPS 2012
Michael Kearns, Michael L. Littman, Satinder Singh (2013). Graphical Models for Game Theory. arXiv:1301.2281
Anthony R. Cassandra, Michael L. Littman, Nevin L. Zhang (2013). Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes. CoRR, February 2013, pdf
Ari Weinstein, Michael L. Littman, Sergiu Goschin (2013). Rollout-based Game-tree Search Outprunes Traditional Alpha-beta. PMLR, Vol. 24 » MCTS, UCT
Sergiu Goschin, Ari Weinstein, Michael L. Littman (2013). The Cross-Entropy Method Optimizes for Quantiles. ICML 2013
Ari Weinstein, Michael L. Littman (2013). Open-Loop Planning in Large-Scale Stochastic Domains. AAAI-2013

External Links

References

↑ Michael Littman at Rutgers University in 2009. This picture was taken by a student during a lecture, Michael L. Littman from Wikipedia
↑ Michael L. Littman (1996). Algorithms for Sequential Decision Making . Ph.D. thesis, Brown University, pdf
↑ Proverb: The Probabilistic Cruciverbalist
↑ Proverb (disambiguation) from Wikipedia
↑ AAAI Outstanding Paper Award:
↑ Greg A. Keim, Noam M. Shazeer, Michael L. Littman, Sushant Agarwal, Catherine M. Cheves, Joseph Fitzgerald, Jason Grosland, Fan Jiang, Shannon Pollard, Karl Weinmeister (1999). PROVERB: The Probabilistic Cruciverbalist. AAAI-1999, pdf
↑ Proverb, the Crossword-Solving Computer Program
↑ Michael L. Littman from Wikipedia
↑ The POMDP Page by Anthony R. Cassandra
↑ dblp: Michael L. Littman

Up one Level

[1] Michael Littman at Rutgers University in 2009. This picture was taken by a student during a lecture, Michael L. Littman from Wikipedia

[2] Michael L. Littman (1996). Algorithms for Sequential Decision Making . Ph.D. thesis, Brown University, pdf

[3] Proverb: The Probabilistic Cruciverbalist

[4] Proverb (disambiguation) from Wikipedia

[5] AAAI Outstanding Paper Award:

[6] Greg A. Keim, Noam M. Shazeer, Michael L. Littman, Sushant Agarwal, Catherine M. Cheves, Joseph Fitzgerald, Jason Grosland, Fan Jiang, Shannon Pollard, Karl Weinmeister (1999). PROVERB: The Probabilistic Cruciverbalist. AAAI-1999, pdf

[7] Proverb, the Crossword-Solving Computer Program

[8] Michael L. Littman from Wikipedia

[9] The POMDP Page by Anthony R. Cassandra

[10] : Michael L. Littman

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

@@ Line 58: / Line 58: @@
 ==2010 ...==
 * [[Thomas J. Walsh]], [[Sergiu Goschin]], [[Michael L. Littman]] ('''2010'''). ''Integrating sample-based planning and model-based reinforcement learning.'' [[AAAI]], [https://pdfs.semanticscholar.org/bdc9/bfb6ecc6fb5afb684df03d7220c46ebdbf4e.pdf pdf] » [[Monte-Carlo Tree Search|MCTS]], [[UCT]], [[Reinforcement Learning]]
+* [[Thomas J. Walsh]], [[István Szita]], [[Carlos Diuk]], [[Michael L. Littman]] ('''2012'''). ''Exploring compact reinforcement-learning representations with linear regression''. [https://arxiv.org/abs/1205.2606 arXiv:1205.2606]
 * [[Michael L. Littman]] ('''2012'''). ''Technical Perspective: A New Way to Search Game Trees''. [[ACM#Communications|Communications of the ACM]], Vol. 55, No. 3
 * [[Ari Weinstein]], [[Michael L. Littman]] ('''2012'''). ''[https://www.aaai.org/ocs/index.php/ICAPS/ICAPS12/paper/view/4697 Bandit-Based Planning and Learning in Continuous-Action Markov Decision Processes]''. [http://dblp.uni-trier.de/db/conf/aips/icaps2012.html ICAPS 2012]
-* [[Mathematician#MKearns|Michael Kearns]], [[Michael L. Littman]], [[Mathematician#SSingh|Satinder Singh]] ('''2013'''). ''Graphical Models for Game Theory''. [http://dblp.uni-trier.de/db/journals/corr/corr1301.html#abs-1301-2281 CoRR, January 2013], [https://www.cis.upenn.edu/~mkearns/papers/graphgames.pdf pdf]
+* [[Mathematician#MKearns|Michael Kearns]], [[Michael L. Littman]], [[Mathematician#SSingh|Satinder Singh]] ('''2013'''). ''Graphical Models for Game Theory''. [https://arxiv.org/abs/1301.2281 arXiv:1301.2281]
 * [[Anthony R. Cassandra]], [[Michael L. Littman]], [[Nevin L. Zhang]] ('''2013'''). ''Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes''. [http://dblp.uni-trier.de/db/journals/corr/corr1302.html#abs-1302-1525 CoRR, February 2013], [http://arxiv.org/ftp/arxiv/papers/1302/1302.1525.pdf pdf]
 * [[Ari Weinstein]], [[Michael L. Littman]], [[Sergiu Goschin]] ('''2013'''). ''[http://proceedings.mlr.press/v24/weinstein12a.html Rollout-based Game-tree Search Outprunes Traditional Alpha-beta]''. [http://proceedings.mlr.press/ PMLR], Vol. 24 » [[Monte-Carlo Tree Search|MCTS]], [[UCT]]

Difference between revisions of "Michael L. Littman"

Revision as of 14:05, 23 October 2018

Contents

Crossword Solver

See also

Markov Models

Selected Publications

1990 ...

2000 ...

2010 ...

External Links

References

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools