Jonathan Baxter

Home * People * Jonathan Baxter



Jonathan Baxter, an Australian computer scientist, entrepreneur, CEO and co-founder of the Web crawling company Panscient , and before quantitative researcher at Two Sigma Investments, a New York City based hedge fund, and distinguished research scientist at WhizBang! Labs, Pittsburgh, Pennsylvania.

=Computer Chess= While affiliated with the Department of Systems Engineering at Australian National University, Jonathan Baxter co-authored the TD-learning chess program KnightCap and further wrote his own engine TDChess based on KnightCap, which won the NC3 1999 Australasian National Computer Chess Championship. After the AlphaZero incident in December 2017, Jonathan Baxter contributed a Monte-Carlo Tree Search wrapper for Stockfish.

=Selected Publications=

1997 ...

 * Jonathan Baxter, Andrew Tridgell, Lex Weaver (1997). Knightcap: A chess program that learns by combining td(λ) with minimax search. 15th International Conference on Machine Learning, pdf via citeseerX
 * Jonathan Baxter, Andrew Tridgell, Lex Weaver (1998). Experiments in Parameter Learning Using Temporal Differences. ICCA Journal, Vol. 21, No. 2
 * Jonathan Baxter, Andrew Tridgell, Lex Weaver (1999). TDLeaf(lambda): Combining Temporal Difference Learning with Game-Tree Search. Australian Journal of Intelligent Information Processing Systems, Vol. 5 No. 1, arXiv:cs/9901001
 * Jonathan Baxter, Andrew Tridgell, Lex Weaver (1999). KnightCap: A chess program that learns by combining TD(lambda) with game-tree search. arXiv:cs/9901002

2000 ...

 * Jonathan Baxter, Andrew Tridgell, Lex Weaver (2000). Learning to Play Chess Using Temporal Differences. Machine Learning, Vol 40, No. 3, pdf
 * Jonathan Baxter, Peter Bartlett (2000). Reinforcement Learning on POMDPs via Direct Gradient Ascent. ICML 2000, pdf
 * Jonathan Baxter, Peter Bartlett (2001). Infinite-Horizon Policy-Gradient Estimation. Journal of Artificial Intelligence Research 15, pdf
 * Lex Weaver, Jonathan Baxter (2001). STD (λ): learning state differences with TD (λ). CiteSeerX
 * Douglas Aberdeen, Jonathan Baxter (2001). Emmerald: a fast matrix-matrix multiply using Intel's SSE instructions. Concurrency and Computation: Practice and Experience, Vol. 13, No. 2

2010 ...

 * Jonathan Baxter (2011). A Model of Inductive Bias Learning. arXiv:1106.0245
 * Jonathan Baxter, Peter Bartlett (2011). Infinite-Horizon Policy-Gradient Estimation. arXiv:1106.0665
 * Peter Bartlett, Jonathan Baxter, Lex Weaver (2011). Experiments with Infinite-Horizon, Policy-Gradient Estimation. arXiv:1106.0666

=Forum Posts=

1998 ...

 * Re: Paderborn '98 by Jonathan Baxter, CCC, June 25, 1998
 * Parameter Tuning by Jonathan Baxter, CCC, October 01, 1998 » Automated Tuning, TD-learning
 * Any docs on crafty's new TB format? by Jonathan Baxter, CCC, November 02, 1998
 * DB and Singular Extensions by Jonathan Baxter, CCC, November 22, 1998 » Deep Blue, Singular Extensions
 * Re: u2600 Rating List -- Feb 1 by Jonathan Baxter, CCC, February 01, 1999

2010 ...

 * MCTS wrapper for StockFish by Jonathan Baxter, FishCooking, December 19, 2017 » Monte-Carlo Tree Search, Stockfish

=External Links=
 * Jonathan Baxter | LinkedIn
 * Jonathan Baxter - Google Scholar Citations
 * sirjbaxter (Jonathan Baxter) · GitHub
 * A machine learning system for extracting structured records from web pages and other text sources - EP 1669896 A2

=References=

Up one level