Revision as of 19:09, 8 June 2018

Home * People * Michael L. Littman

Michael L. Littman ^[1]

Michael Lederman Littman,
an American mathematician, computer scientist and professor of CS at Brown University, and before at Rutgers University and Duke University. He received his Ph.D. in 1996 at Brown University on Algorithms for Sequential Decision Making under advisor Leslie P. Kaelbling ^[2] . His research interests focus on stochastic games and reinforcement learning along with the related Markov Models for decision making, such as Markov Chain, Hidden Markov Model (HMM), Markov Decision Process (MDP), and Partially Observable Markov Decision Process (POMDP).

Crossword Solver

During his time at Duke University in the late 90s, Michael L. Littman worked on an automated crossword solver Proverb ^[3] ^[4], which won an Outstanding Paper Award in 1999 from AAAI ^[5] ^[6] and competed in the American Crossword Puzzle Tournament ^[7] ^[8].

Markov Models

Michael Littman's explanatory grid on Markov Models ^[9] :

Markov Models		Do we have control over the state transitons?
Markov Models		NO	YES
Are the states completely observable?	YES	Markov Chain	MDP
Are the states completely observable?	NO	HMM	POMDP

Selected Publications

^[10]

1990 ...

Justin A. Boyan, Michael L. Littman (1993). Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach. NIPS 1993, pdf
Michael L. Littman (1994). Markov Games as a Framework for Multi-Agent Reinforcement Learning. International Conference on Machine Learning, pdf
Michael L. Littman (1996). Algorithms for Sequential Decision Making . Ph.D. thesis, Brown University, pdf
Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore (1996). Reinforcement Learning: A Survey. JAIR, Vol. 4, pdf
Michael L. Littman (1997). Probabilistic Propositional Planning: Representations and Complexity. [[Conferences#AAAI-97|AAAI-1997]
Michael L. Littman, Judy Goldsmith, Martin Mundhenk (1998). The Computational Complexity of Probabilistic Planning. JAIR, Vol. 9, pdf
Michael L. Littman (1999). Initial Experiments in Stochastic Satisfiability. [[Conferences#AAAI-99|AAAI-1999]
Michael L. Littman, Greg A. Keim, Noam M. Shazeer (1999). Solving Crosswords with PROVERB. AAAI-1999
Noam M. Shazeer, Michael L. Littman, Greg A. Keim (1999). Solving Crossword Puzzles as Probabilistic Constraint Satisfaction. AAAI-1999, CiteSeerX
Greg A. Keim, Noam M. Shazeer, Michael L. Littman, Sushant Agarwal, Catherine M. Cheves, Joseph Fitzgerald, Jason Grosland, Fan Jiang, Shannon Pollard, Karl Weinmeister (1999). PROVERB: The Probabilistic Cruciverbalist. AAAI-1999, pdf

2000 ...

Sebastian Thrun, Michael L. Littman (2000). A Review of Reinforcement Learning. AI Magazine, Vol. 21, No. 1, pdf
Michael L. Littman (2000). Review: Computer Language Games. CG 2000
Jiefu Shi, Michael L. Littman (2000). Abstraction Methods for Game Theoretic Poker. CG 2000
Michael L. Littman, Richard Sutton, Satinder Singh (2001). Predictive Representations of State. NIPS 2001, pdf
Michael L. Littman (2001). Value-function reinforcement learning in Markov games. Cognitive Systems Research, Vol. 2, pdf
Michael L. Littman, Greg A. Keim, Noam M. Shazeer (2002). A probabilistic approach to solving crossword puzzles. Artificial Intelligence, Vol. 134, pdf
Peter Stone, Robert Schapire, Michael L. Littman, János A. Csirik, David McAllester (2003). Decision-Theoretic Bidding Based on Learned Density Models in Simultaneous, Interacting Auctions. JAIR, Vol. 19, pdf
Michael L. Littman (2008). Autonomous Model Learning for Reinforcement Learning. QEST 2008

2010 ...

Thomas J. Walsh, Sergiu Goschin, Michael L. Littman (2010). Integrating sample-based planning and model-based reinforcement learning. AAAI, pdf » MCTS, UCT, Reinforcement Learning
Michael L. Littman (2012). Technical Perspective: A New Way to Search Game Trees. Communications of the ACM, Vol. 55, No. 3
Ari Weinstein, Michael L. Littman (2012). Bandit-Based Planning and Learning in Continuous-Action Markov Decision Processes. ICAPS 2012
Michael Kearns, Michael L. Littman, Satinder Singh (2013). Graphical Models for Game Theory. CoRR, January 2013, pdf
Anthony R. Cassandra, Michael L. Littman, Nevin L. Zhang (2013). Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes. CoRR, February 2013, pdf
Ari Weinstein, Michael L. Littman, Sergiu Goschin (2013). Rollout-based Game-tree Search Outprunes Traditional Alpha-beta. PMLR, Vol. 24 » MCTS, UCT
Sergiu Goschin, Ari Weinstein, Michael L. Littman (2013). The Cross-Entropy Method Optimizes for Quantiles. ICML 2013
Ari Weinstein, Michael L. Littman (2013). Open-Loop Planning in Large-Scale Stochastic Domains. AAAI-2013

External Links

References

↑ Michael Littman at Rutgers University in 2009. This picture was taken by a student during a lecture, Michael L. Littman from Wikipedia
↑ Michael L. Littman (1996). Algorithms for Sequential Decision Making . Ph.D. thesis, Brown University, pdf
↑ Proverb: The Probabilistic Cruciverbalist
↑ Proverb (disambiguation) from Wikipedia
↑ AAAI Outstanding Paper Award:
↑ Greg A. Keim, Noam M. Shazeer, Michael L. Littman, Sushant Agarwal, Catherine M. Cheves, Joseph Fitzgerald, Jason Grosland, Fan Jiang, Shannon Pollard, Karl Weinmeister (1999). PROVERB: The Probabilistic Cruciverbalist. AAAI-1999, pdf
↑ Proverb, the Crossword-Solving Computer Program
↑ Michael L. Littman from Wikipedia
↑ The POMDP Page by Anthony R. Cassandra
↑ dblp: Michael L. Littman

Up one Level

[1] Michael Littman at Rutgers University in 2009. This picture was taken by a student during a lecture, Michael L. Littman from Wikipedia

[2] Michael L. Littman (1996). Algorithms for Sequential Decision Making . Ph.D. thesis, Brown University, pdf

[3] Proverb: The Probabilistic Cruciverbalist

[4] Proverb (disambiguation) from Wikipedia

[5] AAAI Outstanding Paper Award:

[6] Greg A. Keim, Noam M. Shazeer, Michael L. Littman, Sushant Agarwal, Catherine M. Cheves, Joseph Fitzgerald, Jason Grosland, Fan Jiang, Shannon Pollard, Karl Weinmeister (1999). PROVERB: The Probabilistic Cruciverbalist. AAAI-1999, pdf

[7] Proverb, the Crossword-Solving Computer Program

[8] Michael L. Littman from Wikipedia

[9] The POMDP Page by Anthony R. Cassandra

[10] : Michael L. Littman

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

@@ Line 7: / Line 7: @@
 =Crossword Solver=
-<span id="Proverb"></span>During his time at [[Duke University]] in the late 90s, Michael L. Littman worked on an automated [https://en.wikipedia.org/wiki/Crossword crossword] solver '''Proverb''' <ref>[http://www.oneacross.com/proverb/ Proverb: The Probabilistic Cruciverbalist]</ref> <ref>[https://en.wikipedia.org/wiki/Proverb_%28disambiguation%29 Proverb (disambiguation) from Wikipedia]</ref>, which won an Outstanding Paper Award in 1999 from [[AAAI]] <ref>[http://www.aaai.org/Awards/paper.php AAAI Outstanding Paper Award:]</ref> <ref>[https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer], [[Michael L. Littman]], [http://dblp.uni-trier.de/pers/hd/a/Agarwal:Sushant Sushant Agarwal], [http://dblp.uni-trier.de/pers/hd/c/Cheves:Catherine_M= Catherine M. Cheves], [http://dblp.uni-trier.de/pers/hd/f/Fitzgerald:Joseph Joseph Fitzgerald], [http://dblp.uni-trier.de/pers/hd/g/Grosland:Jason Jason Grosland], [http://dblp.uni-trier.de/pers/hd/j/Jiang:Fan Fan Jiang], [http://dblp.uni-trier.de/pers/hd/p/Pollard:Shannon Shannon Pollard], [http://dblp.uni-trier.de/pers/hd/w/Weinmeister:Karl Karl Weinmeister] ('''1999'''). ''PROVERB: The Probabilistic Cruciverbalist''. [http://dblp.uni-trier.de/db/conf/aaai/aaai99.html#ShazeerLK99 AAAI/IAAI 1999], [http://www.cs.columbia.edu/~kathy/cs4701/documents/Proverb.pdf pdf]</ref> and competed in the [https://en.wikipedia.org/wiki/American_Crossword_Puzzle_Tournament American Crossword Puzzle Tournament] <ref>[http://www.crosswordtournament.com/1999/art4.htm Proverb, the Crossword-Solving Computer Program]</ref> <ref>[https://en.wikipedia.org/wiki/Michael_L._Littman Michael L. Littman from Wikipedia]</ref>.
+<span id="Proverb"></span>During his time at [[Duke University]] in the late 90s, Michael L. Littman worked on an automated [https://en.wikipedia.org/wiki/Crossword crossword] solver '''Proverb''' <ref>[http://www.oneacross.com/proverb/ Proverb: The Probabilistic Cruciverbalist]</ref> <ref>[https://en.wikipedia.org/wiki/Proverb_%28disambiguation%29 Proverb (disambiguation) from Wikipedia]</ref>, which won an Outstanding Paper Award in 1999 from [[AAAI]] <ref>[http://www.aaai.org/Awards/paper.php AAAI Outstanding Paper Award:]</ref> <ref>[https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer], [[Michael L. Littman]], [http://dblp.uni-trier.de/pers/hd/a/Agarwal:Sushant Sushant Agarwal], [http://dblp.uni-trier.de/pers/hd/c/Cheves:Catherine_M= Catherine M. Cheves], [http://dblp.uni-trier.de/pers/hd/f/Fitzgerald:Joseph Joseph Fitzgerald], [http://dblp.uni-trier.de/pers/hd/g/Grosland:Jason Jason Grosland], [http://dblp.uni-trier.de/pers/hd/j/Jiang:Fan Fan Jiang], [http://dblp.uni-trier.de/pers/hd/p/Pollard:Shannon Shannon Pollard], [http://dblp.uni-trier.de/pers/hd/w/Weinmeister:Karl Karl Weinmeister] ('''1999'''). ''PROVERB: The Probabilistic Cruciverbalist''. [[Conferences#AAAI-99|AAAI-1999]], [http://www.cs.columbia.edu/~kathy/cs4701/documents/Proverb.pdf pdf]</ref> and competed in the [https://en.wikipedia.org/wiki/American_Crossword_Puzzle_Tournament American Crossword Puzzle Tournament] <ref>[http://www.crosswordtournament.com/1999/art4.htm Proverb, the Crossword-Solving Computer Program]</ref> <ref>[https://en.wikipedia.org/wiki/Michael_L._Littman Michael L. Littman from Wikipedia]</ref>.
 =See also=
 * [[Matthew L. Ginsberg#DRFill|Dr.Fill]] by [[Matthew L. Ginsberg]]
+* [[Ari Weinstein#FSSS-Minimax|FSSS-Minimax]]
 <span id="MarkovModels"></span>
 =Markov Models=
@@ Line 39: / Line 41: @@
 * [[Michael L. Littman]] ('''1996'''). ''Algorithms for Sequential Decision Making'' . Ph.D. thesis, [https://en.wikipedia.org/wiki/Brown_University Brown University], [http://www.cs.rutgers.edu/~mlittman/papers/thesis-with-gammas.pdf pdf]
 * [[Mathematician#LPKaelbling|Leslie Pack Kaelbling]], [[Michael L. Littman]], [[Mathematician#AWMoore|Andrew W. Moore]] ('''1996'''). ''[http://www.cs.washington.edu/research/jair/volume4/kaelbling96a-html/rl-survey.html Reinforcement Learning: A Survey]''. [http://www.jair.org/vol/vol4.html JAIR, Vol. 4], [http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a.pdf pdf]
-* [[Michael L. Littman]] ('''1997'''). ''Probabilistic Propositional Planning: Representations and Complexity''. [http://dblp.uni-trier.de/db/conf/aaai/aaai97.html#Littman97 AAAI/IAAI 1997]
+* [[Michael L. Littman]] ('''1997'''). ''Probabilistic Propositional Planning: Representations and Complexity''. [[Conferences#AAAI-97|AAAI-1997]
 * [[Michael L. Littman]], [[Mathematician#JGoldsmith|Judy Goldsmith]], [[Mathematician#MMundhenk|Martin Mundhenk]] ('''1998'''). ''The Computational Complexity of Probabilistic Planning''. [http://dblp.uni-trier.de/db/journals/jair/jair9.html#LittmanGM98 JAIR, Vol. 9], [http://arxiv.org/pdf/cs/9808101.pdf pdf]
-* [[Michael L. Littman]] ('''1999'''). ''Initial Experiments in Stochastic Satisfiability''. [http://dblp.uni-trier.de/db/conf/aaai/aaai99.html#Littman99 AAAI/IAAI 1999]
+* [[Michael L. Littman]] ('''1999'''). ''Initial Experiments in Stochastic Satisfiability''. [[Conferences#AAAI-99|AAAI-1999]
-* [[Michael L. Littman]], [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer] ('''1999'''). ''Solving Crosswords with PROVERB''. [http://dblp.uni-trier.de/db/conf/aaai/aaai99.html#LittmanKS99 AAAI/IAAI 1999]
+* [[Michael L. Littman]], [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer] ('''1999'''). ''Solving Crosswords with PROVERB''. [[Conferences#AAAI-99|AAAI-1999]]
-* [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer],  [[Michael L. Littman]], [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim] ('''1999'''). ''Solving Crossword Puzzles as Probabilistic Constraint Satisfaction''. [http://dblp.uni-trier.de/db/conf/aaai/aaai99.html#ShazeerLK99 AAAI/IAAI 1999], [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.40.9001 CiteSeerX]
+* [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer],  [[Michael L. Littman]], [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim] ('''1999'''). ''Solving Crossword Puzzles as Probabilistic Constraint Satisfaction''. [[Conferences#AAAI-99|AAAI-1999]], [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.40.9001 CiteSeerX]
-* [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer], [[Michael L. Littman]], [http://dblp.uni-trier.de/pers/hd/a/Agarwal:Sushant Sushant Agarwal], [http://dblp.uni-trier.de/pers/hd/c/Cheves:Catherine_M= Catherine M. Cheves], [http://dblp.uni-trier.de/pers/hd/f/Fitzgerald:Joseph Joseph Fitzgerald], [http://dblp.uni-trier.de/pers/hd/g/Grosland:Jason Jason Grosland], [http://dblp.uni-trier.de/pers/hd/j/Jiang:Fan Fan Jiang], [http://dblp.uni-trier.de/pers/hd/p/Pollard:Shannon Shannon Pollard], [http://dblp.uni-trier.de/pers/hd/w/Weinmeister:Karl Karl Weinmeister] ('''1999'''). ''PROVERB: The Probabilistic Cruciverbalist''. [http://dblp.uni-trier.de/db/conf/aaai/aaai99.html#ShazeerLK99 AAAI/IAAI 1999], [http://www.cs.columbia.edu/~kathy/cs4701/documents/Proverb.pdf pdf]
+* [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer], [[Michael L. Littman]], [http://dblp.uni-trier.de/pers/hd/a/Agarwal:Sushant Sushant Agarwal], [http://dblp.uni-trier.de/pers/hd/c/Cheves:Catherine_M= Catherine M. Cheves], [http://dblp.uni-trier.de/pers/hd/f/Fitzgerald:Joseph Joseph Fitzgerald], [http://dblp.uni-trier.de/pers/hd/g/Grosland:Jason Jason Grosland], [http://dblp.uni-trier.de/pers/hd/j/Jiang:Fan Fan Jiang], [http://dblp.uni-trier.de/pers/hd/p/Pollard:Shannon Shannon Pollard], [http://dblp.uni-trier.de/pers/hd/w/Weinmeister:Karl Karl Weinmeister] ('''1999'''). ''PROVERB: The Probabilistic Cruciverbalist''. [[Conferences#AAAI-99|AAAI-1999]], [http://www.cs.columbia.edu/~kathy/cs4701/documents/Proverb.pdf pdf]
 ==2000 ...==
 * [[Sebastian Thrun]], [[Michael L. Littman]] ('''2000'''). ''A Review of Reinforcement Learning''. [http://www.informatik.uni-trier.de/~ley/db/journals/aim/aim21.html#ThrunL00 AI Magazine, Vol. 21], No. 1, [http://www.aistudy.com/paper/aaai_journal/AIMag21-01-001.pdf pdf]
@@ Line 57: / Line 59: @@
 * [[Thomas J. Walsh]], [[Sergiu Goschin]], [[Michael L. Littman]] ('''2010'''). ''Integrating sample-based planning and model-based reinforcement learning.'' [[AAAI]], [https://pdfs.semanticscholar.org/bdc9/bfb6ecc6fb5afb684df03d7220c46ebdbf4e.pdf pdf] » [[Monte-Carlo Tree Search|MCTS]], [[UCT]], [[Reinforcement Learning]]
 * [[Michael L. Littman]] ('''2012'''). ''Technical Perspective: A New Way to Search Game Trees''. [[ACM#Communications|Communications of the ACM]], Vol. 55, No. 3
+* [[Ari Weinstein]], [[Michael L. Littman]] ('''2012'''). ''[https://www.aaai.org/ocs/index.php/ICAPS/ICAPS12/paper/view/4697 Bandit-Based Planning and Learning in Continuous-Action Markov Decision Processes]''. [http://dblp.uni-trier.de/db/conf/aips/icaps2012.html ICAPS 2012]
 * [[Mathematician#MKearns|Michael Kearns]], [[Michael L. Littman]], [[Mathematician#SSingh|Satinder Singh]] ('''2013'''). ''Graphical Models for Game Theory''. [http://dblp.uni-trier.de/db/journals/corr/corr1301.html#abs-1301-2281 CoRR, January 2013], [https://www.cis.upenn.edu/~mkearns/papers/graphgames.pdf pdf]
 * [[Anthony R. Cassandra]], [[Michael L. Littman]], [[Nevin L. Zhang]] ('''2013'''). ''Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes''. [http://dblp.uni-trier.de/db/journals/corr/corr1302.html#abs-1302-1525 CoRR, February 2013], [http://arxiv.org/ftp/arxiv/papers/1302/1302.1525.pdf pdf]
 * [[Ari Weinstein]], [[Michael L. Littman]], [[Sergiu Goschin]] ('''2013'''). ''[http://proceedings.mlr.press/v24/weinstein12a.html Rollout-based Game-tree Search Outprunes Traditional Alpha-beta]''. [http://proceedings.mlr.press/ PMLR], Vol. 24 » [[Monte-Carlo Tree Search|MCTS]], [[UCT]]
+* [[Sergiu Goschin]], [[Ari Weinstein]], [[Michael L. Littman]] ('''2013'''). ''The Cross-Entropy Method Optimizes for Quantiles''. [http://dblp.uni-trier.de/db/conf/icml/icml2013.html ICML 2013]
+* [[Ari Weinstein]], [[Michael L. Littman]] ('''2013'''). ''Open-Loop Planning in Large-Scale Stochastic Domains''. [[Conferences#AAAI-2013|AAAI-2013]]
 =External Links=
 * [http://cs.brown.edu/~mlittman/ Michael Littman's Home Page]
 * [https://en.wikipedia.org/wiki/Michael_L._Littman Michael L. Littman from Wikipedia]
-* [https://www.genealogy.ams.org/id.php?id=91913 The Mathematics Genealogy Project - Michael Littman]
+* [https://genealogy.math.ndsu.nodak.edu/id.php?id=91913 The Mathematics Genealogy Project - Michael Littman]
 =References=

Difference between revisions of "Michael L. Littman"

Revision as of 19:09, 8 June 2018

Contents

Crossword Solver

See also

Markov Models

Selected Publications

1990 ...

2000 ...

2010 ...

External Links

References

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools