Difference between revisions of "Michael L. Littman"
GerdIsenberg (talk | contribs) (Created page with "'''Home * People * Michael L. Littman''' FILE:Michael Littman at Rutgers 2009.jpg|border|right|thumb|link=https://en.wikipedia.org/wiki/Michael_L._Littman...") |
GerdIsenberg (talk | contribs) |
||
Line 7: | Line 7: | ||
=Crossword Solver= | =Crossword Solver= | ||
− | <span id="Proverb"></span>During his time at [[Duke University]] in the late 90s, Michael L. Littman worked on an automated [https://en.wikipedia.org/wiki/Crossword crossword] solver '''Proverb''' <ref>[http://www.oneacross.com/proverb/ Proverb: The Probabilistic Cruciverbalist]</ref> <ref>[https://en.wikipedia.org/wiki/Proverb_%28disambiguation%29 Proverb (disambiguation) from Wikipedia]</ref>, which won an Outstanding Paper Award in 1999 from [[AAAI]] <ref>[http://www.aaai.org/Awards/paper.php AAAI Outstanding Paper Award:]</ref> <ref>[https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer], [[Michael L. Littman]], [http://dblp.uni-trier.de/pers/hd/a/Agarwal:Sushant Sushant Agarwal], [http://dblp.uni-trier.de/pers/hd/c/Cheves:Catherine_M= Catherine M. Cheves], [http://dblp.uni-trier.de/pers/hd/f/Fitzgerald:Joseph Joseph Fitzgerald], [http://dblp.uni-trier.de/pers/hd/g/Grosland:Jason Jason Grosland], [http://dblp.uni-trier.de/pers/hd/j/Jiang:Fan Fan Jiang], [http://dblp.uni-trier.de/pers/hd/p/Pollard:Shannon Shannon Pollard], [http://dblp.uni-trier.de/pers/hd/w/Weinmeister:Karl Karl Weinmeister] ('''1999'''). ''PROVERB: The Probabilistic Cruciverbalist''. [ | + | <span id="Proverb"></span>During his time at [[Duke University]] in the late 90s, Michael L. Littman worked on an automated [https://en.wikipedia.org/wiki/Crossword crossword] solver '''Proverb''' <ref>[http://www.oneacross.com/proverb/ Proverb: The Probabilistic Cruciverbalist]</ref> <ref>[https://en.wikipedia.org/wiki/Proverb_%28disambiguation%29 Proverb (disambiguation) from Wikipedia]</ref>, which won an Outstanding Paper Award in 1999 from [[AAAI]] <ref>[http://www.aaai.org/Awards/paper.php AAAI Outstanding Paper Award:]</ref> <ref>[https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer], [[Michael L. Littman]], [http://dblp.uni-trier.de/pers/hd/a/Agarwal:Sushant Sushant Agarwal], [http://dblp.uni-trier.de/pers/hd/c/Cheves:Catherine_M= Catherine M. Cheves], [http://dblp.uni-trier.de/pers/hd/f/Fitzgerald:Joseph Joseph Fitzgerald], [http://dblp.uni-trier.de/pers/hd/g/Grosland:Jason Jason Grosland], [http://dblp.uni-trier.de/pers/hd/j/Jiang:Fan Fan Jiang], [http://dblp.uni-trier.de/pers/hd/p/Pollard:Shannon Shannon Pollard], [http://dblp.uni-trier.de/pers/hd/w/Weinmeister:Karl Karl Weinmeister] ('''1999'''). ''PROVERB: The Probabilistic Cruciverbalist''. [[Conferences#AAAI-99|AAAI-1999]], [http://www.cs.columbia.edu/~kathy/cs4701/documents/Proverb.pdf pdf]</ref> and competed in the [https://en.wikipedia.org/wiki/American_Crossword_Puzzle_Tournament American Crossword Puzzle Tournament] <ref>[http://www.crosswordtournament.com/1999/art4.htm Proverb, the Crossword-Solving Computer Program]</ref> <ref>[https://en.wikipedia.org/wiki/Michael_L._Littman Michael L. Littman from Wikipedia]</ref>. |
=See also= | =See also= | ||
* [[Matthew L. Ginsberg#DRFill|Dr.Fill]] by [[Matthew L. Ginsberg]] | * [[Matthew L. Ginsberg#DRFill|Dr.Fill]] by [[Matthew L. Ginsberg]] | ||
+ | * [[Ari Weinstein#FSSS-Minimax|FSSS-Minimax]] | ||
+ | |||
<span id="MarkovModels"></span> | <span id="MarkovModels"></span> | ||
=Markov Models= | =Markov Models= | ||
Line 39: | Line 41: | ||
* [[Michael L. Littman]] ('''1996'''). ''Algorithms for Sequential Decision Making'' . Ph.D. thesis, [https://en.wikipedia.org/wiki/Brown_University Brown University], [http://www.cs.rutgers.edu/~mlittman/papers/thesis-with-gammas.pdf pdf] | * [[Michael L. Littman]] ('''1996'''). ''Algorithms for Sequential Decision Making'' . Ph.D. thesis, [https://en.wikipedia.org/wiki/Brown_University Brown University], [http://www.cs.rutgers.edu/~mlittman/papers/thesis-with-gammas.pdf pdf] | ||
* [[Mathematician#LPKaelbling|Leslie Pack Kaelbling]], [[Michael L. Littman]], [[Mathematician#AWMoore|Andrew W. Moore]] ('''1996'''). ''[http://www.cs.washington.edu/research/jair/volume4/kaelbling96a-html/rl-survey.html Reinforcement Learning: A Survey]''. [http://www.jair.org/vol/vol4.html JAIR, Vol. 4], [http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a.pdf pdf] | * [[Mathematician#LPKaelbling|Leslie Pack Kaelbling]], [[Michael L. Littman]], [[Mathematician#AWMoore|Andrew W. Moore]] ('''1996'''). ''[http://www.cs.washington.edu/research/jair/volume4/kaelbling96a-html/rl-survey.html Reinforcement Learning: A Survey]''. [http://www.jair.org/vol/vol4.html JAIR, Vol. 4], [http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a.pdf pdf] | ||
− | * [[Michael L. Littman]] ('''1997'''). ''Probabilistic Propositional Planning: Representations and Complexity''. [ | + | * [[Michael L. Littman]] ('''1997'''). ''Probabilistic Propositional Planning: Representations and Complexity''. [[Conferences#AAAI-97|AAAI-1997] |
* [[Michael L. Littman]], [[Mathematician#JGoldsmith|Judy Goldsmith]], [[Mathematician#MMundhenk|Martin Mundhenk]] ('''1998'''). ''The Computational Complexity of Probabilistic Planning''. [http://dblp.uni-trier.de/db/journals/jair/jair9.html#LittmanGM98 JAIR, Vol. 9], [http://arxiv.org/pdf/cs/9808101.pdf pdf] | * [[Michael L. Littman]], [[Mathematician#JGoldsmith|Judy Goldsmith]], [[Mathematician#MMundhenk|Martin Mundhenk]] ('''1998'''). ''The Computational Complexity of Probabilistic Planning''. [http://dblp.uni-trier.de/db/journals/jair/jair9.html#LittmanGM98 JAIR, Vol. 9], [http://arxiv.org/pdf/cs/9808101.pdf pdf] | ||
− | * [[Michael L. Littman]] ('''1999'''). ''Initial Experiments in Stochastic Satisfiability''. [ | + | * [[Michael L. Littman]] ('''1999'''). ''Initial Experiments in Stochastic Satisfiability''. [[Conferences#AAAI-99|AAAI-1999] |
− | * [[Michael L. Littman]], [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer] ('''1999'''). ''Solving Crosswords with PROVERB''. [ | + | * [[Michael L. Littman]], [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer] ('''1999'''). ''Solving Crosswords with PROVERB''. [[Conferences#AAAI-99|AAAI-1999]] |
− | * [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer], [[Michael L. Littman]], [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim] ('''1999'''). ''Solving Crossword Puzzles as Probabilistic Constraint Satisfaction''. [ | + | * [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer], [[Michael L. Littman]], [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim] ('''1999'''). ''Solving Crossword Puzzles as Probabilistic Constraint Satisfaction''. [[Conferences#AAAI-99|AAAI-1999]], [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.40.9001 CiteSeerX] |
− | * [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer], [[Michael L. Littman]], [http://dblp.uni-trier.de/pers/hd/a/Agarwal:Sushant Sushant Agarwal], [http://dblp.uni-trier.de/pers/hd/c/Cheves:Catherine_M= Catherine M. Cheves], [http://dblp.uni-trier.de/pers/hd/f/Fitzgerald:Joseph Joseph Fitzgerald], [http://dblp.uni-trier.de/pers/hd/g/Grosland:Jason Jason Grosland], [http://dblp.uni-trier.de/pers/hd/j/Jiang:Fan Fan Jiang], [http://dblp.uni-trier.de/pers/hd/p/Pollard:Shannon Shannon Pollard], [http://dblp.uni-trier.de/pers/hd/w/Weinmeister:Karl Karl Weinmeister] ('''1999'''). ''PROVERB: The Probabilistic Cruciverbalist''. [ | + | * [https://www.linkedin.com/pub/greg-keim/0/960/648 Greg A. Keim], [http://dblp.uni-trier.de/pers/hd/s/Shazeer:Noam_M= Noam M. Shazeer], [[Michael L. Littman]], [http://dblp.uni-trier.de/pers/hd/a/Agarwal:Sushant Sushant Agarwal], [http://dblp.uni-trier.de/pers/hd/c/Cheves:Catherine_M= Catherine M. Cheves], [http://dblp.uni-trier.de/pers/hd/f/Fitzgerald:Joseph Joseph Fitzgerald], [http://dblp.uni-trier.de/pers/hd/g/Grosland:Jason Jason Grosland], [http://dblp.uni-trier.de/pers/hd/j/Jiang:Fan Fan Jiang], [http://dblp.uni-trier.de/pers/hd/p/Pollard:Shannon Shannon Pollard], [http://dblp.uni-trier.de/pers/hd/w/Weinmeister:Karl Karl Weinmeister] ('''1999'''). ''PROVERB: The Probabilistic Cruciverbalist''. [[Conferences#AAAI-99|AAAI-1999]], [http://www.cs.columbia.edu/~kathy/cs4701/documents/Proverb.pdf pdf] |
==2000 ...== | ==2000 ...== | ||
* [[Sebastian Thrun]], [[Michael L. Littman]] ('''2000'''). ''A Review of Reinforcement Learning''. [http://www.informatik.uni-trier.de/~ley/db/journals/aim/aim21.html#ThrunL00 AI Magazine, Vol. 21], No. 1, [http://www.aistudy.com/paper/aaai_journal/AIMag21-01-001.pdf pdf] | * [[Sebastian Thrun]], [[Michael L. Littman]] ('''2000'''). ''A Review of Reinforcement Learning''. [http://www.informatik.uni-trier.de/~ley/db/journals/aim/aim21.html#ThrunL00 AI Magazine, Vol. 21], No. 1, [http://www.aistudy.com/paper/aaai_journal/AIMag21-01-001.pdf pdf] | ||
Line 57: | Line 59: | ||
* [[Thomas J. Walsh]], [[Sergiu Goschin]], [[Michael L. Littman]] ('''2010'''). ''Integrating sample-based planning and model-based reinforcement learning.'' [[AAAI]], [https://pdfs.semanticscholar.org/bdc9/bfb6ecc6fb5afb684df03d7220c46ebdbf4e.pdf pdf] » [[Monte-Carlo Tree Search|MCTS]], [[UCT]], [[Reinforcement Learning]] | * [[Thomas J. Walsh]], [[Sergiu Goschin]], [[Michael L. Littman]] ('''2010'''). ''Integrating sample-based planning and model-based reinforcement learning.'' [[AAAI]], [https://pdfs.semanticscholar.org/bdc9/bfb6ecc6fb5afb684df03d7220c46ebdbf4e.pdf pdf] » [[Monte-Carlo Tree Search|MCTS]], [[UCT]], [[Reinforcement Learning]] | ||
* [[Michael L. Littman]] ('''2012'''). ''Technical Perspective: A New Way to Search Game Trees''. [[ACM#Communications|Communications of the ACM]], Vol. 55, No. 3 | * [[Michael L. Littman]] ('''2012'''). ''Technical Perspective: A New Way to Search Game Trees''. [[ACM#Communications|Communications of the ACM]], Vol. 55, No. 3 | ||
+ | * [[Ari Weinstein]], [[Michael L. Littman]] ('''2012'''). ''[https://www.aaai.org/ocs/index.php/ICAPS/ICAPS12/paper/view/4697 Bandit-Based Planning and Learning in Continuous-Action Markov Decision Processes]''. [http://dblp.uni-trier.de/db/conf/aips/icaps2012.html ICAPS 2012] | ||
* [[Mathematician#MKearns|Michael Kearns]], [[Michael L. Littman]], [[Mathematician#SSingh|Satinder Singh]] ('''2013'''). ''Graphical Models for Game Theory''. [http://dblp.uni-trier.de/db/journals/corr/corr1301.html#abs-1301-2281 CoRR, January 2013], [https://www.cis.upenn.edu/~mkearns/papers/graphgames.pdf pdf] | * [[Mathematician#MKearns|Michael Kearns]], [[Michael L. Littman]], [[Mathematician#SSingh|Satinder Singh]] ('''2013'''). ''Graphical Models for Game Theory''. [http://dblp.uni-trier.de/db/journals/corr/corr1301.html#abs-1301-2281 CoRR, January 2013], [https://www.cis.upenn.edu/~mkearns/papers/graphgames.pdf pdf] | ||
* [[Anthony R. Cassandra]], [[Michael L. Littman]], [[Nevin L. Zhang]] ('''2013'''). ''Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes''. [http://dblp.uni-trier.de/db/journals/corr/corr1302.html#abs-1302-1525 CoRR, February 2013], [http://arxiv.org/ftp/arxiv/papers/1302/1302.1525.pdf pdf] | * [[Anthony R. Cassandra]], [[Michael L. Littman]], [[Nevin L. Zhang]] ('''2013'''). ''Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes''. [http://dblp.uni-trier.de/db/journals/corr/corr1302.html#abs-1302-1525 CoRR, February 2013], [http://arxiv.org/ftp/arxiv/papers/1302/1302.1525.pdf pdf] | ||
* [[Ari Weinstein]], [[Michael L. Littman]], [[Sergiu Goschin]] ('''2013'''). ''[http://proceedings.mlr.press/v24/weinstein12a.html Rollout-based Game-tree Search Outprunes Traditional Alpha-beta]''. [http://proceedings.mlr.press/ PMLR], Vol. 24 » [[Monte-Carlo Tree Search|MCTS]], [[UCT]] | * [[Ari Weinstein]], [[Michael L. Littman]], [[Sergiu Goschin]] ('''2013'''). ''[http://proceedings.mlr.press/v24/weinstein12a.html Rollout-based Game-tree Search Outprunes Traditional Alpha-beta]''. [http://proceedings.mlr.press/ PMLR], Vol. 24 » [[Monte-Carlo Tree Search|MCTS]], [[UCT]] | ||
+ | * [[Sergiu Goschin]], [[Ari Weinstein]], [[Michael L. Littman]] ('''2013'''). ''The Cross-Entropy Method Optimizes for Quantiles''. [http://dblp.uni-trier.de/db/conf/icml/icml2013.html ICML 2013] | ||
+ | * [[Ari Weinstein]], [[Michael L. Littman]] ('''2013'''). ''Open-Loop Planning in Large-Scale Stochastic Domains''. [[Conferences#AAAI-2013|AAAI-2013]] | ||
=External Links= | =External Links= | ||
* [http://cs.brown.edu/~mlittman/ Michael Littman's Home Page] | * [http://cs.brown.edu/~mlittman/ Michael Littman's Home Page] | ||
* [https://en.wikipedia.org/wiki/Michael_L._Littman Michael L. Littman from Wikipedia] | * [https://en.wikipedia.org/wiki/Michael_L._Littman Michael L. Littman from Wikipedia] | ||
− | * [https:// | + | * [https://genealogy.math.ndsu.nodak.edu/id.php?id=91913 The Mathematics Genealogy Project - Michael Littman] |
=References= | =References= |
Revision as of 19:09, 8 June 2018
Home * People * Michael L. Littman
Michael Lederman Littman,
an American mathematician, computer scientist and professor of CS at Brown University, and before at Rutgers University and Duke University. He received his Ph.D. in 1996 at Brown University on Algorithms for Sequential Decision Making under advisor Leslie P. Kaelbling [2] . His research interests focus on stochastic games and reinforcement learning along with the related Markov Models for decision making, such as Markov Chain, Hidden Markov Model (HMM), Markov Decision Process (MDP), and Partially Observable Markov Decision Process (POMDP).
Contents
Crossword Solver
During his time at Duke University in the late 90s, Michael L. Littman worked on an automated crossword solver Proverb [3] [4], which won an Outstanding Paper Award in 1999 from AAAI [5] [6] and competed in the American Crossword Puzzle Tournament [7] [8].
See also
Markov Models
Michael Littman's explanatory grid on Markov Models [9] :
Markov Models |
Do we have control over the state transitons? | ||
NO | YES | ||
Are the states completely observable? |
YES | Markov Chain | MDP |
NO | HMM | POMDP |
Selected Publications
1990 ...
- Justin A. Boyan, Michael L. Littman (1993). Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach. NIPS 1993, pdf
- Michael L. Littman (1994). Markov Games as a Framework for Multi-Agent Reinforcement Learning. International Conference on Machine Learning, pdf
- Michael L. Littman (1996). Algorithms for Sequential Decision Making . Ph.D. thesis, Brown University, pdf
- Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore (1996). Reinforcement Learning: A Survey. JAIR, Vol. 4, pdf
- Michael L. Littman (1997). Probabilistic Propositional Planning: Representations and Complexity. [[Conferences#AAAI-97|AAAI-1997]
- Michael L. Littman, Judy Goldsmith, Martin Mundhenk (1998). The Computational Complexity of Probabilistic Planning. JAIR, Vol. 9, pdf
- Michael L. Littman (1999). Initial Experiments in Stochastic Satisfiability. [[Conferences#AAAI-99|AAAI-1999]
- Michael L. Littman, Greg A. Keim, Noam M. Shazeer (1999). Solving Crosswords with PROVERB. AAAI-1999
- Noam M. Shazeer, Michael L. Littman, Greg A. Keim (1999). Solving Crossword Puzzles as Probabilistic Constraint Satisfaction. AAAI-1999, CiteSeerX
- Greg A. Keim, Noam M. Shazeer, Michael L. Littman, Sushant Agarwal, Catherine M. Cheves, Joseph Fitzgerald, Jason Grosland, Fan Jiang, Shannon Pollard, Karl Weinmeister (1999). PROVERB: The Probabilistic Cruciverbalist. AAAI-1999, pdf
2000 ...
- Sebastian Thrun, Michael L. Littman (2000). A Review of Reinforcement Learning. AI Magazine, Vol. 21, No. 1, pdf
- Michael L. Littman (2000). Review: Computer Language Games. CG 2000
- Jiefu Shi, Michael L. Littman (2000). Abstraction Methods for Game Theoretic Poker. CG 2000
- Michael L. Littman, Richard Sutton, Satinder Singh (2001). Predictive Representations of State. NIPS 2001, pdf
- Michael L. Littman (2001). Value-function reinforcement learning in Markov games. Cognitive Systems Research, Vol. 2, pdf
- Michael L. Littman, Greg A. Keim, Noam M. Shazeer (2002). A probabilistic approach to solving crossword puzzles. Artificial Intelligence, Vol. 134, pdf
- Peter Stone, Robert Schapire, Michael L. Littman, János A. Csirik, David McAllester (2003). Decision-Theoretic Bidding Based on Learned Density Models in Simultaneous, Interacting Auctions. JAIR, Vol. 19, pdf
- Michael L. Littman (2008). Autonomous Model Learning for Reinforcement Learning. QEST 2008
2010 ...
- Thomas J. Walsh, Sergiu Goschin, Michael L. Littman (2010). Integrating sample-based planning and model-based reinforcement learning. AAAI, pdf » MCTS, UCT, Reinforcement Learning
- Michael L. Littman (2012). Technical Perspective: A New Way to Search Game Trees. Communications of the ACM, Vol. 55, No. 3
- Ari Weinstein, Michael L. Littman (2012). Bandit-Based Planning and Learning in Continuous-Action Markov Decision Processes. ICAPS 2012
- Michael Kearns, Michael L. Littman, Satinder Singh (2013). Graphical Models for Game Theory. CoRR, January 2013, pdf
- Anthony R. Cassandra, Michael L. Littman, Nevin L. Zhang (2013). Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes. CoRR, February 2013, pdf
- Ari Weinstein, Michael L. Littman, Sergiu Goschin (2013). Rollout-based Game-tree Search Outprunes Traditional Alpha-beta. PMLR, Vol. 24 » MCTS, UCT
- Sergiu Goschin, Ari Weinstein, Michael L. Littman (2013). The Cross-Entropy Method Optimizes for Quantiles. ICML 2013
- Ari Weinstein, Michael L. Littman (2013). Open-Loop Planning in Large-Scale Stochastic Domains. AAAI-2013
External Links
- Michael Littman's Home Page
- Michael L. Littman from Wikipedia
- The Mathematics Genealogy Project - Michael Littman
References
- ↑ Michael Littman at Rutgers University in 2009. This picture was taken by a student during a lecture, Michael L. Littman from Wikipedia
- ↑ Michael L. Littman (1996). Algorithms for Sequential Decision Making . Ph.D. thesis, Brown University, pdf
- ↑ Proverb: The Probabilistic Cruciverbalist
- ↑ Proverb (disambiguation) from Wikipedia
- ↑ AAAI Outstanding Paper Award:
- ↑ Greg A. Keim, Noam M. Shazeer, Michael L. Littman, Sushant Agarwal, Catherine M. Cheves, Joseph Fitzgerald, Jason Grosland, Fan Jiang, Shannon Pollard, Karl Weinmeister (1999). PROVERB: The Probabilistic Cruciverbalist. AAAI-1999, pdf
- ↑ Proverb, the Crossword-Solving Computer Program
- ↑ Michael L. Littman from Wikipedia
- ↑ The POMDP Page by Anthony R. Cassandra
- ↑ dblp: Michael L. Littman