Difference between revisions of "Jonathan Baxter"
GerdIsenberg (talk | contribs) (Created page with "'''Home * People * Jonathan Baxter''' FILE:JonathanBaxter.jpg|border|right|thumb|link=https://www.linkedin.com/in/jonathan-baxter-a612/|Jonathan Baxter <r...") |
GerdIsenberg (talk | contribs) |
||
(3 intermediate revisions by the same user not shown) | |||
Line 19: | Line 19: | ||
==2000 ...== | ==2000 ...== | ||
* [[Jonathan Baxter]], [[Andrew Tridgell]], [[Lex Weaver]] ('''2000'''). ''Learning to Play Chess Using Temporal Differences''. [http://www.dblp.org/db/journals/ml/ml40.html#BaxterTW00 Machine Learning, Vol 40, No. 3], [http://www.cs.princeton.edu/courses/archive/fall06/cos402/papers/chess-RL.pdf pdf] | * [[Jonathan Baxter]], [[Andrew Tridgell]], [[Lex Weaver]] ('''2000'''). ''Learning to Play Chess Using Temporal Differences''. [http://www.dblp.org/db/journals/ml/ml40.html#BaxterTW00 Machine Learning, Vol 40, No. 3], [http://www.cs.princeton.edu/courses/archive/fall06/cos402/papers/chess-RL.pdf pdf] | ||
− | * [[Jonathan Baxter]], [[Peter Bartlett]] ('''2000'''). ''Reinforcement Learning on POMDPs via Direct Gradient Ascent''. [http://dblp.uni-trier.de/db/conf/icml/icml2000.html ICML 2000], [https://pdfs.semanticscholar.org/b874/98f0879d312c308889135203b17069aa0486.pdf pdf] | + | * [[Jonathan Baxter]], [[Mathematician#PBartlett|Peter Bartlett]] ('''2000'''). ''Reinforcement Learning on POMDPs via Direct Gradient Ascent''. [http://dblp.uni-trier.de/db/conf/icml/icml2000.html ICML 2000], [https://pdfs.semanticscholar.org/b874/98f0879d312c308889135203b17069aa0486.pdf pdf] |
− | * [[Jonathan Baxter]], [[Peter Bartlett]] ('''2001'''). ''Infinite-Horizon Policy-Gradient Estimation''. [https://en.wikipedia.org/wiki/Journal_of_Artificial_Intelligence_Research Journal of Artificial Intelligence Research] 15, [http://neuro.bstu.by/ai/To-dom/My_research/Papers-2.0/STDP/Reinforcement-L/A/Refs/baxter01a.pdf pdf] | + | * [[Jonathan Baxter]], [[Mathematician#PBartlett|Peter Bartlett]] ('''2001'''). ''Infinite-Horizon Policy-Gradient Estimation''. [https://en.wikipedia.org/wiki/Journal_of_Artificial_Intelligence_Research Journal of Artificial Intelligence Research] 15, [http://neuro.bstu.by/ai/To-dom/My_research/Papers-2.0/STDP/Reinforcement-L/A/Refs/baxter01a.pdf pdf] |
+ | * [[Lex Weaver]], [[Jonathan Baxter]] ('''2001'''). ''STD (λ): learning state differences with TD (λ)''. [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.20.7737 CiteSeerX] | ||
* [https://www.linkedin.com/in/dougaberdeen/ Douglas Aberdeen], [[Jonathan Baxter]] ('''2001'''). ''[https://onlinelibrary.wiley.com/doi/abs/10.1002/cpe.549 Emmerald: a fast matrix-matrix multiply using Intel's SSE instructions]''. [https://onlinelibrary.wiley.com/journal/15320634 Concurrency and Computation: Practice and Experience], Vol. 13, No. 2 | * [https://www.linkedin.com/in/dougaberdeen/ Douglas Aberdeen], [[Jonathan Baxter]] ('''2001'''). ''[https://onlinelibrary.wiley.com/doi/abs/10.1002/cpe.549 Emmerald: a fast matrix-matrix multiply using Intel's SSE instructions]''. [https://onlinelibrary.wiley.com/journal/15320634 Concurrency and Computation: Practice and Experience], Vol. 13, No. 2 | ||
==2010 ...== | ==2010 ...== | ||
− | |||
* [[Jonathan Baxter]] ('''2011'''). ''A Model of Inductive Bias Learning''. [https://arxiv.org/abs/1106.0245 arXiv:1106.0245] <ref>[https://en.wikipedia.org/wiki/Inductive_bias Inductive bias from Wikipedia]</ref> | * [[Jonathan Baxter]] ('''2011'''). ''A Model of Inductive Bias Learning''. [https://arxiv.org/abs/1106.0245 arXiv:1106.0245] <ref>[https://en.wikipedia.org/wiki/Inductive_bias Inductive bias from Wikipedia]</ref> | ||
+ | * [[Jonathan Baxter]], [[Mathematician#PBartlett|Peter Bartlett]] ('''2011'''). ''Infinite-Horizon Policy-Gradient Estimation''. [https://arxiv.org/abs/1106.0665 arXiv:1106.0665] | ||
+ | * [[Mathematician#PBartlett|Peter Bartlett]], [[Jonathan Baxter]], [[Lex Weaver]] ('''2011'''). ''Experiments with Infinite-Horizon, Policy-Gradient Estimation''. [https://arxiv.org/abs/1106.0666 arXiv:1106.0666] | ||
=Forum Posts= | =Forum Posts= | ||
Line 46: | Line 48: | ||
'''[[People|Up one level]]''' | '''[[People|Up one level]]''' | ||
+ | [[Category:Researcher|Baxter]] | ||
+ | [[Category:Chess Programmer|Baxter]] |
Latest revision as of 22:40, 23 May 2019
Home * People * Jonathan Baxter
Jonathan Baxter,
an Australian computer scientist, entrepreneur, CEO and co-founder of the Web crawling company Panscient [2] [3], and before quantitative researcher at Two Sigma Investments, a New York City based hedge fund, and distinguished research scientist at WhizBang! Labs, Pittsburgh, Pennsylvania [4] [5] [6].
Contents
Computer Chess
While affiliated with the Department of Systems Engineering at Australian National University, Jonathan Baxter co-authored the TD-learning chess program KnightCap [7] and further wrote his own engine TDChess based on KnightCap [8], which won the NC3 1999 Australasian National Computer Chess Championship [9]. After the AlphaZero incident in December 2017, Jonathan Baxter contributed a Monte-Carlo Tree Search wrapper for Stockfish [10].
Selected Publications
1997 ...
- Jonathan Baxter, Andrew Tridgell, Lex Weaver (1997). Knightcap: A chess program that learns by combining td(λ) with minimax search. 15th International Conference on Machine Learning, pdf via citeseerX
- Jonathan Baxter, Andrew Tridgell, Lex Weaver (1998). Experiments in Parameter Learning Using Temporal Differences. ICCA Journal, Vol. 21, No. 2
- Jonathan Baxter, Andrew Tridgell, Lex Weaver (1999). TDLeaf(lambda): Combining Temporal Difference Learning with Game-Tree Search. Australian Journal of Intelligent Information Processing Systems, Vol. 5 No. 1, arXiv:cs/9901001
- Jonathan Baxter, Andrew Tridgell, Lex Weaver (1999). KnightCap: A chess program that learns by combining TD(lambda) with game-tree search. arXiv:cs/9901002
2000 ...
- Jonathan Baxter, Andrew Tridgell, Lex Weaver (2000). Learning to Play Chess Using Temporal Differences. Machine Learning, Vol 40, No. 3, pdf
- Jonathan Baxter, Peter Bartlett (2000). Reinforcement Learning on POMDPs via Direct Gradient Ascent. ICML 2000, pdf
- Jonathan Baxter, Peter Bartlett (2001). Infinite-Horizon Policy-Gradient Estimation. Journal of Artificial Intelligence Research 15, pdf
- Lex Weaver, Jonathan Baxter (2001). STD (λ): learning state differences with TD (λ). CiteSeerX
- Douglas Aberdeen, Jonathan Baxter (2001). Emmerald: a fast matrix-matrix multiply using Intel's SSE instructions. Concurrency and Computation: Practice and Experience, Vol. 13, No. 2
2010 ...
- Jonathan Baxter (2011). A Model of Inductive Bias Learning. arXiv:1106.0245 [12]
- Jonathan Baxter, Peter Bartlett (2011). Infinite-Horizon Policy-Gradient Estimation. arXiv:1106.0665
- Peter Bartlett, Jonathan Baxter, Lex Weaver (2011). Experiments with Infinite-Horizon, Policy-Gradient Estimation. arXiv:1106.0666
Forum Posts
1998 ...
- Re: Paderborn '98 by Jonathan Baxter, CCC, June 25, 1998
- Parameter Tuning by Jonathan Baxter, CCC, October 01, 1998 » Automated Tuning, TD-learning
- Any docs on crafty's new TB format? by Jonathan Baxter, CCC, November 02, 1998
- DB and Singular Extensions by Jonathan Baxter, CCC, November 22, 1998 » Deep Blue, Singular Extensions
- Re: u2600 Rating List -- Feb 1 by Jonathan Baxter, CCC, February 01, 1999
2010 ...
- MCTS wrapper for StockFish by Jonathan Baxter, FishCooking, December 19, 2017 » Monte-Carlo Tree Search, Stockfish
External Links
- Jonathan Baxter | LinkedIn
- Jonathan Baxter - Google Scholar Citations
- sirjbaxter (Jonathan Baxter) · GitHub
- A machine learning system for extracting structured records from web pages and other text sources - EP 1669896 A2
References
- ↑ Jonathan Baxter | LinkedIn
- ↑ panscient.com = still bad bot
- ↑ Panscient: People search, company search, vertical search, information extraction, business intelligence, lead generation, machine learning
- ↑ Talk Description
- ↑ ICML-2002 Workshop on Development of Representations
- ↑ Inxight Acquires WhizBang! Labs, November 4, 2002
- ↑ Welcome to the KnightCap home page
- ↑ Re: u2600 Rating List -- Feb 1 by Jonathan Baxter, CCC, February 01, 1999
- ↑ Re: Australian-ch, questions ! by David Blackman, CCC, January 09, 2000
- ↑ MCTS wrapper for StockFish by Jonathan Baxter, FishCooking, December 19, 2017
- ↑ dblp: Jonathan Baxter
- ↑ Inductive bias from Wikipedia