Difference between revisions of "Michael L. Littman"

From Chessprogramming wiki
Jump to: navigation, search
(Created page with "'''Home * People * Michael L. Littman''' FILE:Michael Littman at Rutgers 2009.jpg|border|right|thumb|link=https://en.wikipedia.org/wiki/Michael_L._Littman...")
 
Line 7: Line 7:
  
 
=Crossword Solver=
 
=Crossword Solver=
<span id="Proverb"></span>During his time at [[Duke University]] in the late 90s, Michael L. Littman worked on an automated [https://en.wikipedia.org/wiki/Crossword crossword] solver '''Proverb''' <ref>[http://www.oneacross.com/proverb/ Proverb: The Probabilistic Cruciverbalist]</ref> <ref>[https://en.wikipedia.org/wiki/Proverb_%28disambiguation%29 Proverb (disambiguation) from Wikipedia]</ref>, which won an Outstanding Paper Award in 1999 from [[AAAI]] <ref>[http://www.aaai.org/Awards/paper.php AAAI Outstanding Paper Award:]</ref> <ref>[https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer], [[Michael L. Littman]], [http://dblp.uni-trier.de/pers/hd/a/Agarwal:Sushant Sushant Agarwal], [http://dblp.uni-trier.de/pers/hd/c/Cheves:Catherine_M= Catherine M. Cheves], [http://dblp.uni-trier.de/pers/hd/f/Fitzgerald:Joseph Joseph Fitzgerald], [http://dblp.uni-trier.de/pers/hd/g/Grosland:Jason Jason Grosland], [http://dblp.uni-trier.de/pers/hd/j/Jiang:Fan Fan Jiang], [http://dblp.uni-trier.de/pers/hd/p/Pollard:Shannon Shannon Pollard], [http://dblp.uni-trier.de/pers/hd/w/Weinmeister:Karl Karl Weinmeister] ('''1999'''). ''PROVERB: The Probabilistic Cruciverbalist''. [http://dblp.uni-trier.de/db/conf/aaai/aaai99.html#ShazeerLK99 AAAI/IAAI 1999], [http://www.cs.columbia.edu/~kathy/cs4701/documents/Proverb.pdf pdf]</ref> and competed in the [https://en.wikipedia.org/wiki/American_Crossword_Puzzle_Tournament American Crossword Puzzle Tournament] <ref>[http://www.crosswordtournament.com/1999/art4.htm Proverb, the Crossword-Solving Computer Program]</ref> <ref>[https://en.wikipedia.org/wiki/Michael_L._Littman Michael L. Littman from Wikipedia]</ref>.  
+
<span id="Proverb"></span>During his time at [[Duke University]] in the late 90s, Michael L. Littman worked on an automated [https://en.wikipedia.org/wiki/Crossword crossword] solver '''Proverb''' <ref>[http://www.oneacross.com/proverb/ Proverb: The Probabilistic Cruciverbalist]</ref> <ref>[https://en.wikipedia.org/wiki/Proverb_%28disambiguation%29 Proverb (disambiguation) from Wikipedia]</ref>, which won an Outstanding Paper Award in 1999 from [[AAAI]] <ref>[http://www.aaai.org/Awards/paper.php AAAI Outstanding Paper Award:]</ref> <ref>[https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer], [[Michael L. Littman]], [http://dblp.uni-trier.de/pers/hd/a/Agarwal:Sushant Sushant Agarwal], [http://dblp.uni-trier.de/pers/hd/c/Cheves:Catherine_M= Catherine M. Cheves], [http://dblp.uni-trier.de/pers/hd/f/Fitzgerald:Joseph Joseph Fitzgerald], [http://dblp.uni-trier.de/pers/hd/g/Grosland:Jason Jason Grosland], [http://dblp.uni-trier.de/pers/hd/j/Jiang:Fan Fan Jiang], [http://dblp.uni-trier.de/pers/hd/p/Pollard:Shannon Shannon Pollard], [http://dblp.uni-trier.de/pers/hd/w/Weinmeister:Karl Karl Weinmeister] ('''1999'''). ''PROVERB: The Probabilistic Cruciverbalist''. [[Conferences#AAAI-99|AAAI-1999]], [http://www.cs.columbia.edu/~kathy/cs4701/documents/Proverb.pdf pdf]</ref> and competed in the [https://en.wikipedia.org/wiki/American_Crossword_Puzzle_Tournament American Crossword Puzzle Tournament] <ref>[http://www.crosswordtournament.com/1999/art4.htm Proverb, the Crossword-Solving Computer Program]</ref> <ref>[https://en.wikipedia.org/wiki/Michael_L._Littman Michael L. Littman from Wikipedia]</ref>.  
  
 
=See also=
 
=See also=
 
* [[Matthew L. Ginsberg#DRFill|Dr.Fill]] by [[Matthew L. Ginsberg]]
 
* [[Matthew L. Ginsberg#DRFill|Dr.Fill]] by [[Matthew L. Ginsberg]]
 +
* [[Ari Weinstein#FSSS-Minimax|FSSS-Minimax]]
 +
 
<span id="MarkovModels"></span>
 
<span id="MarkovModels"></span>
 
=Markov Models=  
 
=Markov Models=  
Line 39: Line 41:
 
* [[Michael L. Littman]] ('''1996'''). ''Algorithms for Sequential Decision Making'' . Ph.D. thesis, [https://en.wikipedia.org/wiki/Brown_University Brown University], [http://www.cs.rutgers.edu/~mlittman/papers/thesis-with-gammas.pdf pdf]
 
* [[Michael L. Littman]] ('''1996'''). ''Algorithms for Sequential Decision Making'' . Ph.D. thesis, [https://en.wikipedia.org/wiki/Brown_University Brown University], [http://www.cs.rutgers.edu/~mlittman/papers/thesis-with-gammas.pdf pdf]
 
* [[Mathematician#LPKaelbling|Leslie Pack Kaelbling]], [[Michael L. Littman]], [[Mathematician#AWMoore|Andrew W. Moore]] ('''1996'''). ''[http://www.cs.washington.edu/research/jair/volume4/kaelbling96a-html/rl-survey.html Reinforcement Learning: A Survey]''. [http://www.jair.org/vol/vol4.html JAIR, Vol. 4], [http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a.pdf pdf]
 
* [[Mathematician#LPKaelbling|Leslie Pack Kaelbling]], [[Michael L. Littman]], [[Mathematician#AWMoore|Andrew W. Moore]] ('''1996'''). ''[http://www.cs.washington.edu/research/jair/volume4/kaelbling96a-html/rl-survey.html Reinforcement Learning: A Survey]''. [http://www.jair.org/vol/vol4.html JAIR, Vol. 4], [http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a.pdf pdf]
* [[Michael L. Littman]] ('''1997'''). ''Probabilistic Propositional Planning: Representations and Complexity''. [http://dblp.uni-trier.de/db/conf/aaai/aaai97.html#Littman97 AAAI/IAAI 1997]
+
* [[Michael L. Littman]] ('''1997'''). ''Probabilistic Propositional Planning: Representations and Complexity''. [[Conferences#AAAI-97|AAAI-1997]
 
* [[Michael L. Littman]], [[Mathematician#JGoldsmith|Judy Goldsmith]], [[Mathematician#MMundhenk|Martin Mundhenk]] ('''1998'''). ''The Computational Complexity of Probabilistic Planning''. [http://dblp.uni-trier.de/db/journals/jair/jair9.html#LittmanGM98 JAIR, Vol. 9], [http://arxiv.org/pdf/cs/9808101.pdf pdf]
 
* [[Michael L. Littman]], [[Mathematician#JGoldsmith|Judy Goldsmith]], [[Mathematician#MMundhenk|Martin Mundhenk]] ('''1998'''). ''The Computational Complexity of Probabilistic Planning''. [http://dblp.uni-trier.de/db/journals/jair/jair9.html#LittmanGM98 JAIR, Vol. 9], [http://arxiv.org/pdf/cs/9808101.pdf pdf]
* [[Michael L. Littman]] ('''1999'''). ''Initial Experiments in Stochastic Satisfiability''. [http://dblp.uni-trier.de/db/conf/aaai/aaai99.html#Littman99 AAAI/IAAI 1999]
+
* [[Michael L. Littman]] ('''1999'''). ''Initial Experiments in Stochastic Satisfiability''. [[Conferences#AAAI-99|AAAI-1999]
* [[Michael L. Littman]], [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer] ('''1999'''). ''Solving Crosswords with PROVERB''. [http://dblp.uni-trier.de/db/conf/aaai/aaai99.html#LittmanKS99 AAAI/IAAI 1999]
+
* [[Michael L. Littman]], [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer] ('''1999'''). ''Solving Crosswords with PROVERB''. [[Conferences#AAAI-99|AAAI-1999]]
* [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer],  [[Michael L. Littman]], [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim] ('''1999'''). ''Solving Crossword Puzzles as Probabilistic Constraint Satisfaction''. [http://dblp.uni-trier.de/db/conf/aaai/aaai99.html#ShazeerLK99 AAAI/IAAI 1999], [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.40.9001 CiteSeerX]
+
* [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer],  [[Michael L. Littman]], [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim] ('''1999'''). ''Solving Crossword Puzzles as Probabilistic Constraint Satisfaction''. [[Conferences#AAAI-99|AAAI-1999]], [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.40.9001 CiteSeerX]
* [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer], [[Michael L. Littman]], [http://dblp.uni-trier.de/pers/hd/a/Agarwal:Sushant Sushant Agarwal], [http://dblp.uni-trier.de/pers/hd/c/Cheves:Catherine_M= Catherine M. Cheves], [http://dblp.uni-trier.de/pers/hd/f/Fitzgerald:Joseph Joseph Fitzgerald], [http://dblp.uni-trier.de/pers/hd/g/Grosland:Jason Jason Grosland], [http://dblp.uni-trier.de/pers/hd/j/Jiang:Fan Fan Jiang], [http://dblp.uni-trier.de/pers/hd/p/Pollard:Shannon Shannon Pollard], [http://dblp.uni-trier.de/pers/hd/w/Weinmeister:Karl Karl Weinmeister] ('''1999'''). ''PROVERB: The Probabilistic Cruciverbalist''. [http://dblp.uni-trier.de/db/conf/aaai/aaai99.html#ShazeerLK99 AAAI/IAAI 1999], [http://www.cs.columbia.edu/~kathy/cs4701/documents/Proverb.pdf pdf]
+
* [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer], [[Michael L. Littman]], [http://dblp.uni-trier.de/pers/hd/a/Agarwal:Sushant Sushant Agarwal], [http://dblp.uni-trier.de/pers/hd/c/Cheves:Catherine_M= Catherine M. Cheves], [http://dblp.uni-trier.de/pers/hd/f/Fitzgerald:Joseph Joseph Fitzgerald], [http://dblp.uni-trier.de/pers/hd/g/Grosland:Jason Jason Grosland], [http://dblp.uni-trier.de/pers/hd/j/Jiang:Fan Fan Jiang], [http://dblp.uni-trier.de/pers/hd/p/Pollard:Shannon Shannon Pollard], [http://dblp.uni-trier.de/pers/hd/w/Weinmeister:Karl Karl Weinmeister] ('''1999'''). ''PROVERB: The Probabilistic Cruciverbalist''. [[Conferences#AAAI-99|AAAI-1999]], [http://www.cs.columbia.edu/~kathy/cs4701/documents/Proverb.pdf pdf]
 
==2000 ...==  
 
==2000 ...==  
 
* [[Sebastian Thrun]], [[Michael L. Littman]] ('''2000'''). ''A Review of Reinforcement Learning''. [http://www.informatik.uni-trier.de/~ley/db/journals/aim/aim21.html#ThrunL00 AI Magazine, Vol. 21], No. 1, [http://www.aistudy.com/paper/aaai_journal/AIMag21-01-001.pdf pdf]
 
* [[Sebastian Thrun]], [[Michael L. Littman]] ('''2000'''). ''A Review of Reinforcement Learning''. [http://www.informatik.uni-trier.de/~ley/db/journals/aim/aim21.html#ThrunL00 AI Magazine, Vol. 21], No. 1, [http://www.aistudy.com/paper/aaai_journal/AIMag21-01-001.pdf pdf]
Line 57: Line 59:
 
* [[Thomas J. Walsh]], [[Sergiu Goschin]], [[Michael L. Littman]] ('''2010'''). ''Integrating sample-based planning and model-based reinforcement learning.'' [[AAAI]], [https://pdfs.semanticscholar.org/bdc9/bfb6ecc6fb5afb684df03d7220c46ebdbf4e.pdf pdf] » [[Monte-Carlo Tree Search|MCTS]], [[UCT]], [[Reinforcement Learning]]
 
* [[Thomas J. Walsh]], [[Sergiu Goschin]], [[Michael L. Littman]] ('''2010'''). ''Integrating sample-based planning and model-based reinforcement learning.'' [[AAAI]], [https://pdfs.semanticscholar.org/bdc9/bfb6ecc6fb5afb684df03d7220c46ebdbf4e.pdf pdf] » [[Monte-Carlo Tree Search|MCTS]], [[UCT]], [[Reinforcement Learning]]
 
* [[Michael L. Littman]] ('''2012'''). ''Technical Perspective: A New Way to Search Game Trees''. [[ACM#Communications|Communications of the ACM]], Vol. 55, No. 3
 
* [[Michael L. Littman]] ('''2012'''). ''Technical Perspective: A New Way to Search Game Trees''. [[ACM#Communications|Communications of the ACM]], Vol. 55, No. 3
 +
* [[Ari Weinstein]], [[Michael L. Littman]] ('''2012'''). ''[https://www.aaai.org/ocs/index.php/ICAPS/ICAPS12/paper/view/4697 Bandit-Based Planning and Learning in Continuous-Action Markov Decision Processes]''. [http://dblp.uni-trier.de/db/conf/aips/icaps2012.html ICAPS 2012]
 
* [[Mathematician#MKearns|Michael Kearns]], [[Michael L. Littman]], [[Mathematician#SSingh|Satinder Singh]] ('''2013'''). ''Graphical Models for Game Theory''. [http://dblp.uni-trier.de/db/journals/corr/corr1301.html#abs-1301-2281 CoRR, January 2013], [https://www.cis.upenn.edu/~mkearns/papers/graphgames.pdf pdf]
 
* [[Mathematician#MKearns|Michael Kearns]], [[Michael L. Littman]], [[Mathematician#SSingh|Satinder Singh]] ('''2013'''). ''Graphical Models for Game Theory''. [http://dblp.uni-trier.de/db/journals/corr/corr1301.html#abs-1301-2281 CoRR, January 2013], [https://www.cis.upenn.edu/~mkearns/papers/graphgames.pdf pdf]
 
* [[Anthony R. Cassandra]], [[Michael L. Littman]], [[Nevin L. Zhang]] ('''2013'''). ''Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes''. [http://dblp.uni-trier.de/db/journals/corr/corr1302.html#abs-1302-1525 CoRR, February 2013], [http://arxiv.org/ftp/arxiv/papers/1302/1302.1525.pdf pdf]
 
* [[Anthony R. Cassandra]], [[Michael L. Littman]], [[Nevin L. Zhang]] ('''2013'''). ''Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes''. [http://dblp.uni-trier.de/db/journals/corr/corr1302.html#abs-1302-1525 CoRR, February 2013], [http://arxiv.org/ftp/arxiv/papers/1302/1302.1525.pdf pdf]
 
* [[Ari Weinstein]], [[Michael L. Littman]], [[Sergiu Goschin]] ('''2013'''). ''[http://proceedings.mlr.press/v24/weinstein12a.html Rollout-based Game-tree Search Outprunes Traditional Alpha-beta]''. [http://proceedings.mlr.press/ PMLR], Vol. 24 » [[Monte-Carlo Tree Search|MCTS]], [[UCT]]
 
* [[Ari Weinstein]], [[Michael L. Littman]], [[Sergiu Goschin]] ('''2013'''). ''[http://proceedings.mlr.press/v24/weinstein12a.html Rollout-based Game-tree Search Outprunes Traditional Alpha-beta]''. [http://proceedings.mlr.press/ PMLR], Vol. 24 » [[Monte-Carlo Tree Search|MCTS]], [[UCT]]
 +
* [[Sergiu Goschin]], [[Ari Weinstein]], [[Michael L. Littman]] ('''2013'''). ''The Cross-Entropy Method Optimizes for Quantiles''. [http://dblp.uni-trier.de/db/conf/icml/icml2013.html ICML 2013]
 +
* [[Ari Weinstein]], [[Michael L. Littman]] ('''2013'''). ''Open-Loop Planning in Large-Scale Stochastic Domains''. [[Conferences#AAAI-2013|AAAI-2013]]
  
 
=External Links=  
 
=External Links=  
 
* [http://cs.brown.edu/~mlittman/ Michael Littman's Home Page]
 
* [http://cs.brown.edu/~mlittman/ Michael Littman's Home Page]
 
* [https://en.wikipedia.org/wiki/Michael_L._Littman Michael L. Littman from Wikipedia]
 
* [https://en.wikipedia.org/wiki/Michael_L._Littman Michael L. Littman from Wikipedia]
* [https://www.genealogy.ams.org/id.php?id=91913 The Mathematics Genealogy Project - Michael Littman]
+
* [https://genealogy.math.ndsu.nodak.edu/id.php?id=91913 The Mathematics Genealogy Project - Michael Littman]
  
 
=References=  
 
=References=  

Revision as of 19:09, 8 June 2018

Home * People * Michael L. Littman

Michael L. Littman [1]

Michael Lederman Littman,
an American mathematician, computer scientist and professor of CS at Brown University, and before at Rutgers University and Duke University. He received his Ph.D. in 1996 at Brown University on Algorithms for Sequential Decision Making under advisor Leslie P. Kaelbling [2] . His research interests focus on stochastic games and reinforcement learning along with the related Markov Models for decision making, such as Markov Chain, Hidden Markov Model (HMM), Markov Decision Process (MDP), and Partially Observable Markov Decision Process (POMDP).

Crossword Solver

During his time at Duke University in the late 90s, Michael L. Littman worked on an automated crossword solver Proverb [3] [4], which won an Outstanding Paper Award in 1999 from AAAI [5] [6] and competed in the American Crossword Puzzle Tournament [7] [8].

See also

Markov Models

Michael Littman's explanatory grid on Markov Models [9] :

Markov
Models
Do we have control
over the state transitons?
NO YES
Are the states
completely observable?
YES Markov Chain MDP
NO HMM POMDP

Selected Publications

[10]

1990 ...

2000 ...

2010 ...

External Links

References

Up one Level