Changes

Jump to: navigation, search

Jonathan Baxter

164 bytes added, 21:40, 23 May 2019
no edit summary
==2000 ...==
* [[Jonathan Baxter]], [[Andrew Tridgell]], [[Lex Weaver]] ('''2000'''). ''Learning to Play Chess Using Temporal Differences''. [http://www.dblp.org/db/journals/ml/ml40.html#BaxterTW00 Machine Learning, Vol 40, No. 3], [http://www.cs.princeton.edu/courses/archive/fall06/cos402/papers/chess-RL.pdf pdf]
* [[Jonathan Baxter]], [[Mathematician#PBartlett|Peter Bartlett]] ('''2000'''). ''Reinforcement Learning on POMDPs via Direct Gradient Ascent''. [http://dblp.uni-trier.de/db/conf/icml/icml2000.html ICML 2000], [https://pdfs.semanticscholar.org/b874/98f0879d312c308889135203b17069aa0486.pdf pdf]* [[Jonathan Baxter]], [[Mathematician#PBartlett|Peter Bartlett]] ('''2001'''). ''Infinite-Horizon Policy-Gradient Estimation''. [https://en.wikipedia.org/wiki/Journal_of_Artificial_Intelligence_Research Journal of Artificial Intelligence Research] 15, [http://neuro.bstu.by/ai/To-dom/My_research/Papers-2.0/STDP/Reinforcement-L/A/Refs/baxter01a.pdf pdf]
* [[Lex Weaver]], [[Jonathan Baxter]] ('''2001'''). ''STD (λ): learning state differences with TD (λ)''. [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.20.7737 CiteSeerX]
* [https://www.linkedin.com/in/dougaberdeen/ Douglas Aberdeen], [[Jonathan Baxter]] ('''2001'''). ''[https://onlinelibrary.wiley.com/doi/abs/10.1002/cpe.549 Emmerald: a fast matrix-matrix multiply using Intel's SSE instructions]''. [https://onlinelibrary.wiley.com/journal/15320634 Concurrency and Computation: Practice and Experience], Vol. 13, No. 2
==2010 ...==
* [[Jonathan Baxter]] ('''2011'''). ''A Model of Inductive Bias Learning''. [https://arxiv.org/abs/1106.0245 arXiv:1106.0245] <ref>[https://en.wikipedia.org/wiki/Inductive_bias Inductive bias from Wikipedia]</ref>
* [[Jonathan Baxter]], [[Mathematician#PBartlett|Peter Bartlett]] ('''2011'''). ''Infinite-Horizon Policy-Gradient Estimation''. [https://arxiv.org/abs/1106.0665 arXiv:1106.0665]* [[Mathematician#PBartlett|Peter Bartlett]], [[Jonathan Baxter]], [[Lex Weaver]] ('''2011'''). ''Experiments with Infinite-Horizon, Policy-Gradient Estimation''. [https://arxiv.org/abs/1106.0666 arXiv:1106.0666]
=Forum Posts=
'''[[People|Up one level]]'''
[[Category:Researcher|Baxter]]
[[Category:Chess Programmer|Baxter]]

Navigation menu