Difference between revisions of "Dynamic Programming"

From Chessprogramming wiki
Jump to: navigation, search
(Created page with "'''Home * Programming * Algorithms * Dynamic Programming''' '''Dynamic Programming''', (DP)<br/> a mathematical, algorithmic optimization method of Re...")
 
Line 33: Line 33:
 
* [[Mathematician#SEDreyfus|Stuart E. Dreyfus]], [http://www.averill-law.com/about/ Averill M. Law] ('''1977'''). ''[http://dl.acm.org/citation.cfm?id=578655 Art and Theory of Dynamic Programming]''. [https://en.wikipedia.org/wiki/Academic_Press Academic Press]
 
* [[Mathematician#SEDreyfus|Stuart E. Dreyfus]], [http://www.averill-law.com/about/ Averill M. Law] ('''1977'''). ''[http://dl.acm.org/citation.cfm?id=578655 Art and Theory of Dynamic Programming]''. [https://en.wikipedia.org/wiki/Academic_Press Academic Press]
 
* [[Mathematician#SEShreve|Steven E. Shreve]], [[Mathematician#DBertsekas|Dimitri Bertsekas]] ('''1979'''). ''[http://pubsonline.informs.org/doi/abs/10.1287/moor.4.1.15?journalCode=moor Universally Measurable Policies in Dynamic Programming]''. [https://en.wikipedia.org/wiki/Mathematics_of_Operations_Research Mathematics of Operations Research], Vol. 4, No. 1
 
* [[Mathematician#SEShreve|Steven E. Shreve]], [[Mathematician#DBertsekas|Dimitri Bertsekas]] ('''1979'''). ''[http://pubsonline.informs.org/doi/abs/10.1287/moor.4.1.15?journalCode=moor Universally Measurable Policies in Dynamic Programming]''. [https://en.wikipedia.org/wiki/Mathematics_of_Operations_Research Mathematics of Operations Research], Vol. 4, No. 1
 +
==1980 ...==
 +
* [[Andrew Barto]], [[Richard Sutton]], [https://dblp.uni-trier.de/pers/hd/w/Watkins:Christopher_J=_C=_H= Christopher J. C. H. Watkins] ('''1989'''). ''[https://papers.nips.cc/paper/194-sequential-decision-problems-and-neural-networks Sequential Decision Problems and Neural Networks]''. [https://dblp.uni-trier.de/db/conf/nips/nips1989.html NIPS 1989]
 
==1990 ...==
 
==1990 ...==
 
* [[Mathematician#DBertsekas|Dimitri Bertsekas]] ('''1996, 2017'''). ''[http://www.athenasc.com/dpbook.html Dynamic Programming and Optimal Control]''. [http://www.athenasc.com/index.html Athena Scientific]
 
* [[Mathematician#DBertsekas|Dimitri Bertsekas]] ('''1996, 2017'''). ''[http://www.athenasc.com/dpbook.html Dynamic Programming and Optimal Control]''. [http://www.athenasc.com/index.html Athena Scientific]
Line 45: Line 47:
 
==2010 ...==
 
==2010 ...==
 
* [[Richard E. Bellman]] ('''2010'''). ''[http://press.princeton.edu/titles/9234.html Dynamic Programming]''. With a new introduction by [[Mathematician#SEDreyfus|Stuart E. Dreyfus]], [https://en.wikipedia.org/wiki/Princeton_University_Press Princeton University Press]
 
* [[Richard E. Bellman]] ('''2010'''). ''[http://press.princeton.edu/titles/9234.html Dynamic Programming]''. With a new introduction by [[Mathematician#SEDreyfus|Stuart E. Dreyfus]], [https://en.wikipedia.org/wiki/Princeton_University_Press Princeton University Press]
 +
* [[Andrew Barto]] ('''2010'''). ''Adaptive Real-Time Dynamic Programming''. Encyclopedia of Machine Learning 2010, Springer
 
* [[Rémi Munos]] ('''2010'''). ''Approximate dynamic programming''. In [http://www.isir.upmc.fr/?op=view_profil&id=28&old=N&lang=en Olivier Sigaud] and [http://www.loria.fr/~buffet/ Olivier Buffet], editors, [http://eu.wiley.com/WileyCDA/WileyTitle/productCd-1848211678.html Markov Decision Processes in Artificial Intelligence], chapter 3, ISTE Ltd and [http://eu.wiley.com/WileyCDA/ John Wiley & Sons Inc.], [http://researchers.lille.inria.fr/~munos/papers/files/MDPIA_chap3.pdf pdf]
 
* [[Rémi Munos]] ('''2010'''). ''Approximate dynamic programming''. In [http://www.isir.upmc.fr/?op=view_profil&id=28&old=N&lang=en Olivier Sigaud] and [http://www.loria.fr/~buffet/ Olivier Buffet], editors, [http://eu.wiley.com/WileyCDA/WileyTitle/productCd-1848211678.html Markov Decision Processes in Artificial Intelligence], chapter 3, ISTE Ltd and [http://eu.wiley.com/WileyCDA/ John Wiley & Sons Inc.], [http://researchers.lille.inria.fr/~munos/papers/files/MDPIA_chap3.pdf pdf]
 
* [http://dblp.uni-trier.de/pers/hd/p/Powell:Warren_B= Warren B. Powell] ('''2011'''). ''[http://adp.princeton.edu/ Approximate Dynamic Programming - Solving the Curses of Dimensionality]''. 2nd edition, [https://en.wikipedia.org/wiki/John_Wiley_%26_Sons Wiley]
 
* [http://dblp.uni-trier.de/pers/hd/p/Powell:Warren_B= Warren B. Powell] ('''2011'''). ''[http://adp.princeton.edu/ Approximate Dynamic Programming - Solving the Curses of Dimensionality]''. 2nd edition, [https://en.wikipedia.org/wiki/John_Wiley_%26_Sons Wiley]
 
* [[Mathematician#DBertsekas|Dimitri Bertsekas]] ('''2013'''). ''[http://www.athenasc.com/abstractdp.html Abstract Dynamic Programming]''. [http://www.athenasc.com/index.html Athena Scientific]
 
* [[Mathematician#DBertsekas|Dimitri Bertsekas]] ('''2013'''). ''[http://www.athenasc.com/abstractdp.html Abstract Dynamic Programming]''. [http://www.athenasc.com/index.html Athena Scientific]
 
* [[Richard E. Bellman]], [[Mathematician#SEDreyfus|Stuart E. Dreyfus]] ('''2015'''). ''[http://press.princeton.edu/titles/100.html Applied Dynamic Programming]''. [https://en.wikipedia.org/wiki/Princeton_University_Press Princeton University Press]
 
* [[Richard E. Bellman]], [[Mathematician#SEDreyfus|Stuart E. Dreyfus]] ('''2015'''). ''[http://press.princeton.edu/titles/100.html Applied Dynamic Programming]''. [https://en.wikipedia.org/wiki/Princeton_University_Press Princeton University Press]
 +
* [[Andrew Barto]] ('''2017'''). ''Adaptive Real-Time Dynamic Programming''. [https://link.springer.com/referencework/10.1007%2F978-1-4899-7687-1 Encyclopedia of Machine Learning and Data Mining 2017], [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer]
  
 
=External Links=
 
=External Links=

Revision as of 09:51, 9 June 2018

Home * Programming * Algorithms * Dynamic Programming

Dynamic Programming, (DP)
a mathematical, algorithmic optimization method of recursively nesting overlapping sub problems of optimal substructure inside larger decision problems. The term DP was coined by Richard E. Bellman in the 50s not as programming in the sense of producing computer code, but mathematical programming, planning or optimization similar to linear programming, devoted to the study of multistage processes. These processes are composed of sequences of operations in which the outcome of those preceding may be used to guide the course of future ones [1].

DP in Computer Chess

In computer chess, dynamic programming is applied in depth-first search with memoization aka using a transposition table and/or other hash tables while traversing a tree of overlapping sub problems aka child positions after making a move by one side in top-down manner, gaining from stored positions of sibling subtrees due to transpositions and/or common aspects of positions, in particular effective inside an iterative deepening framework. Another approach of dynamic programming in computer chess or computer games is the application of retrograde analysis, to solve a problem by solving subproblems in bottom-up manner starting from terminal nodes [2].

See also

Selected Publications

1953 ...

1960 ...

1970 ...

1980 ...

1990 ...

2000 ...

2010 ...

External Links

Algorithms that use dynamic programming

References

Up one Level