Jonathan Baxter
Home * People * Jonathan Baxter
Jonathan Baxter,
an Australian computer scientist, entrepreneur, CEO and co-founder of the Web crawling company Panscient [2] [3], and before quantitative researcher at Two Sigma Investments, a New York City based hedge fund, and distinguished research scientist at WhizBang! Labs, Pittsburgh, Pennsylvania [4] [5] [6].
Contents
Computer Chess
While affiliated with the Department of Systems Engineering at Australian National University, Jonathan Baxter co-authored the TD-learning chess program KnightCap [7] and further wrote his own engine TDChess based on KnightCap [8], which won the NC3 1999 Australasian National Computer Chess Championship [9]. After the AlphaZero incident in December 2017, Jonathan Baxter contributed a Monte-Carlo Tree Search wrapper for Stockfish [10].
Selected Publications
1997 ...
- Jonathan Baxter, Andrew Tridgell, Lex Weaver (1997). Knightcap: A chess program that learns by combining td(λ) with minimax search. 15th International Conference on Machine Learning, pdf via citeseerX
- Jonathan Baxter, Andrew Tridgell, Lex Weaver (1998). Experiments in Parameter Learning Using Temporal Differences. ICCA Journal, Vol. 21, No. 2
- Jonathan Baxter, Andrew Tridgell, Lex Weaver (1999). TDLeaf(lambda): Combining Temporal Difference Learning with Game-Tree Search. Australian Journal of Intelligent Information Processing Systems, Vol. 5 No. 1, arXiv:cs/9901001
- Jonathan Baxter, Andrew Tridgell, Lex Weaver (1999). KnightCap: A chess program that learns by combining TD(lambda) with game-tree search. arXiv:cs/9901002
2000 ...
- Jonathan Baxter, Andrew Tridgell, Lex Weaver (2000). Learning to Play Chess Using Temporal Differences. Machine Learning, Vol 40, No. 3, pdf
- Jonathan Baxter, Peter Bartlett (2000). Reinforcement Learning on POMDPs via Direct Gradient Ascent. ICML 2000, pdf
- Jonathan Baxter, Peter Bartlett (2001). Infinite-Horizon Policy-Gradient Estimation. Journal of Artificial Intelligence Research 15, pdf
- Lex Weaver, Jonathan Baxter (2001). STD (λ): learning state differences with TD (λ). CiteSeerX
- Douglas Aberdeen, Jonathan Baxter (2001). Emmerald: a fast matrix-matrix multiply using Intel's SSE instructions. Concurrency and Computation: Practice and Experience, Vol. 13, No. 2
2010 ...
- Jonathan Baxter (2011). A Model of Inductive Bias Learning. arXiv:1106.0245 [12]
- Jonathan Baxter, Peter Bartlett (2011). Infinite-Horizon Policy-Gradient Estimation. arXiv:1106.0665
- Peter Bartlett, Jonathan Baxter, Lex Weaver (2011). Experiments with Infinite-Horizon, Policy-Gradient Estimation. arXiv:1106.0666
Forum Posts
1998 ...
- Re: Paderborn '98 by Jonathan Baxter, CCC, June 25, 1998
- Parameter Tuning by Jonathan Baxter, CCC, October 01, 1998 » Automated Tuning, TD-learning
- Any docs on crafty's new TB format? by Jonathan Baxter, CCC, November 02, 1998
- DB and Singular Extensions by Jonathan Baxter, CCC, November 22, 1998 » Deep Blue, Singular Extensions
- Re: u2600 Rating List -- Feb 1 by Jonathan Baxter, CCC, February 01, 1999
2010 ...
- MCTS wrapper for StockFish by Jonathan Baxter, FishCooking, December 19, 2017 » Monte-Carlo Tree Search, Stockfish
External Links
- Jonathan Baxter | LinkedIn
- Jonathan Baxter - Google Scholar Citations
- sirjbaxter (Jonathan Baxter) · GitHub
- A machine learning system for extracting structured records from web pages and other text sources - EP 1669896 A2
References
- ↑ Jonathan Baxter | LinkedIn
- ↑ panscient.com = still bad bot
- ↑ Panscient: People search, company search, vertical search, information extraction, business intelligence, lead generation, machine learning
- ↑ Talk Description
- ↑ ICML-2002 Workshop on Development of Representations
- ↑ Inxight Acquires WhizBang! Labs, November 4, 2002
- ↑ Welcome to the KnightCap home page
- ↑ Re: u2600 Rating List -- Feb 1 by Jonathan Baxter, CCC, February 01, 1999
- ↑ Re: Australian-ch, questions ! by David Blackman, CCC, January 09, 2000
- ↑ MCTS wrapper for StockFish by Jonathan Baxter, FishCooking, December 19, 2017
- ↑ dblp: Jonathan Baxter
- ↑ Inductive bias from Wikipedia