Latest revision as of 22:40, 23 May 2019

Home * People * Jonathan Baxter

Jonathan Baxter ^[1]

Jonathan Baxter,
an Australian computer scientist, entrepreneur, CEO and co-founder of the Web crawling company Panscient ^[2] ^[3], and before quantitative researcher at Two Sigma Investments, a New York City based hedge fund, and distinguished research scientist at WhizBang! Labs, Pittsburgh, Pennsylvania ^[4] ^[5] ^[6].

Computer Chess

While affiliated with the Department of Systems Engineering at Australian National University, Jonathan Baxter co-authored the TD-learning chess program KnightCap ^[7] and further wrote his own engine TDChess based on KnightCap ^[8], which won the NC3 1999 Australasian National Computer Chess Championship ^[9]. After the AlphaZero incident in December 2017, Jonathan Baxter contributed a Monte-Carlo Tree Search wrapper for Stockfish ^[10].

Selected Publications

^[11]

1997 ...

Jonathan Baxter, Andrew Tridgell, Lex Weaver (1997). Knightcap: A chess program that learns by combining td(λ) with minimax search. 15th International Conference on Machine Learning, pdf via citeseerX
Jonathan Baxter, Andrew Tridgell, Lex Weaver (1998). Experiments in Parameter Learning Using Temporal Differences. ICCA Journal, Vol. 21, No. 2
Jonathan Baxter, Andrew Tridgell, Lex Weaver (1999). TDLeaf(lambda): Combining Temporal Difference Learning with Game-Tree Search. Australian Journal of Intelligent Information Processing Systems, Vol. 5 No. 1, arXiv:cs/9901001
Jonathan Baxter, Andrew Tridgell, Lex Weaver (1999). KnightCap: A chess program that learns by combining TD(lambda) with game-tree search. arXiv:cs/9901002

2000 ...

Jonathan Baxter, Andrew Tridgell, Lex Weaver (2000). Learning to Play Chess Using Temporal Differences. Machine Learning, Vol 40, No. 3, pdf
Jonathan Baxter, Peter Bartlett (2000). Reinforcement Learning on POMDPs via Direct Gradient Ascent. ICML 2000, pdf
Jonathan Baxter, Peter Bartlett (2001). Infinite-Horizon Policy-Gradient Estimation. Journal of Artificial Intelligence Research 15, pdf
Lex Weaver, Jonathan Baxter (2001). STD (λ): learning state differences with TD (λ). CiteSeerX
Douglas Aberdeen, Jonathan Baxter (2001). Emmerald: a fast matrix-matrix multiply using Intel's SSE instructions. Concurrency and Computation: Practice and Experience, Vol. 13, No. 2

2010 ...

Jonathan Baxter (2011). A Model of Inductive Bias Learning. arXiv:1106.0245 ^[12]
Jonathan Baxter, Peter Bartlett (2011). Infinite-Horizon Policy-Gradient Estimation. arXiv:1106.0665
Peter Bartlett, Jonathan Baxter, Lex Weaver (2011). Experiments with Infinite-Horizon, Policy-Gradient Estimation. arXiv:1106.0666

Forum Posts

1998 ...

Re: Paderborn '98 by Jonathan Baxter, CCC, June 25, 1998
Parameter Tuning by Jonathan Baxter, CCC, October 01, 1998 » Automated Tuning, TD-learning
Any docs on crafty's new TB format? by Jonathan Baxter, CCC, November 02, 1998
DB and Singular Extensions by Jonathan Baxter, CCC, November 22, 1998 » Deep Blue, Singular Extensions
Re: u2600 Rating List -- Feb 1 by Jonathan Baxter, CCC, February 01, 1999

2010 ...

MCTS wrapper for StockFish by Jonathan Baxter, FishCooking, December 19, 2017 » Monte-Carlo Tree Search, Stockfish

External Links

References

Up one level

[1] Jonathan Baxter | LinkedIn

[2] scient.com = still bad bot

[3] Panscient: People search, company search, vertical search, information extraction, business intelligence, lead generation, machine learning

[4] Talk Description

[5] ICML-2002 Workshop on Development of Representations

[6] Inxight Acquires WhizBang! Labs, November 4, 2002

[7] Welcome to the KnightCap home page

[8] Re: u2600 Rating List -- Feb 1 by Jonathan Baxter, CCC, February 01, 1999

[9] Re: Australian-ch, questions ! by David Blackman, CCC, January 09, 2000

[10] MCTS wrapper for StockFish by Jonathan Baxter, FishCooking, December 19, 2017

[11] : Jonathan Baxter

[12] Inductive bias from Wikipedia

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

@@ Line 19: / Line 19: @@
 ==2000 ...==
 * [[Jonathan Baxter]], [[Andrew Tridgell]], [[Lex Weaver]] ('''2000'''). ''Learning to Play Chess Using Temporal Differences''. [http://www.dblp.org/db/journals/ml/ml40.html#BaxterTW00 Machine Learning, Vol 40, No. 3], [http://www.cs.princeton.edu/courses/archive/fall06/cos402/papers/chess-RL.pdf pdf]
-* [[Jonathan Baxter]], [[Peter Bartlett]] ('''2000'''). ''Reinforcement Learning on POMDPs via Direct Gradient Ascent''. [http://dblp.uni-trier.de/db/conf/icml/icml2000.html ICML 2000], [https://pdfs.semanticscholar.org/b874/98f0879d312c308889135203b17069aa0486.pdf pdf]
+* [[Jonathan Baxter]], [[Mathematician#PBartlett|Peter Bartlett]] ('''2000'''). ''Reinforcement Learning on POMDPs via Direct Gradient Ascent''. [http://dblp.uni-trier.de/db/conf/icml/icml2000.html ICML 2000], [https://pdfs.semanticscholar.org/b874/98f0879d312c308889135203b17069aa0486.pdf pdf]
-* [[Jonathan Baxter]], [[Peter Bartlett]] ('''2001'''). ''Infinite-Horizon Policy-Gradient Estimation''. [https://en.wikipedia.org/wiki/Journal_of_Artificial_Intelligence_Research Journal of Artificial Intelligence Research] 15, [http://neuro.bstu.by/ai/To-dom/My_research/Papers-2.0/STDP/Reinforcement-L/A/Refs/baxter01a.pdf pdf]
+* [[Jonathan Baxter]], [[Mathematician#PBartlett|Peter Bartlett]] ('''2001'''). ''Infinite-Horizon Policy-Gradient Estimation''. [https://en.wikipedia.org/wiki/Journal_of_Artificial_Intelligence_Research Journal of Artificial Intelligence Research] 15, [http://neuro.bstu.by/ai/To-dom/My_research/Papers-2.0/STDP/Reinforcement-L/A/Refs/baxter01a.pdf pdf]
+* [[Lex Weaver]], [[Jonathan Baxter]] ('''2001'''). ''STD (λ): learning state differences with TD (λ)''. [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.20.7737 CiteSeerX]
 * [https://www.linkedin.com/in/dougaberdeen/ Douglas Aberdeen], [[Jonathan Baxter]] ('''2001'''). ''[https://onlinelibrary.wiley.com/doi/abs/10.1002/cpe.549 Emmerald: a fast matrix-matrix multiply using Intel's SSE instructions]''. [https://onlinelibrary.wiley.com/journal/15320634 Concurrency and Computation: Practice and Experience], Vol. 13, No. 2
 ==2010 ...==
-* [[Jonathan Baxter]], [[Peter Bartlett]] ('''2011'''). ''Infinite-Horizon Policy-Gradient Estimation''. [https://arxiv.org/abs/1106.0665 arXiv:1106.0665]
 * [[Jonathan Baxter]] ('''2011'''). ''A Model of Inductive Bias Learning''. [https://arxiv.org/abs/1106.0245 arXiv:1106.0245] <ref>[https://en.wikipedia.org/wiki/Inductive_bias Inductive bias from Wikipedia]</ref>
+* [[Jonathan Baxter]], [[Mathematician#PBartlett|Peter Bartlett]] ('''2011'''). ''Infinite-Horizon Policy-Gradient Estimation''. [https://arxiv.org/abs/1106.0665 arXiv:1106.0665]
+* [[Mathematician#PBartlett|Peter Bartlett]], [[Jonathan Baxter]], [[Lex Weaver]] ('''2011'''). ''Experiments with Infinite-Horizon, Policy-Gradient Estimation''. [https://arxiv.org/abs/1106.0666 arXiv:1106.0666]
 =Forum Posts=
@@ Line 46: / Line 48: @@
 '''[[People|Up one level]]'''
+[[Category:Researcher|Baxter]]
+[[Category:Chess Programmer|Baxter]]

Difference between revisions of "Jonathan Baxter"

Latest revision as of 22:40, 23 May 2019

Contents

Computer Chess

Selected Publications

1997 ...

2000 ...

2010 ...

Forum Posts

1998 ...

2010 ...

External Links

References

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools