Changes

Newer edit →

Lex Weaver

3,424 bytes added, 13:29, 23 June 2018

Created page with "'''Home * People * Lex Weaver''' '''Lex Weaver''',<br/> an Australian computer scientist, since 2004 manager at the [https://en.wikipedia.org/wiki/Governme..."

'''[[Main Page|Home]] * [[People]] * Lex Weaver'''

'''Lex Weaver''',<br/>
an Australian computer scientist, since 2004 manager at the [https://en.wikipedia.org/wiki/Government_of_Australia Australian Government], and before lecturer in the Department of Computer Science at the [[Australian National University]], [https://en.wikipedia.org/wiki/Canberra Canberra], Australia, where he already defended his B.Sc. and Ph.D. degrees in 1994 and 2003 respectively <ref>[https://www.linkedin.com/in/lex-weaver-6554bba9/ Lex Weaver | LinkedIn]</ref>. Lex Weaver researched on [[Learning|machine learning]] and in particular [[Temporal Difference Learning]], and co-authored of the chess program [[KnightCap]] along with [[Jonathan Baxter]] and [[Andrew Tridgell]] <ref>[http://samba.org/KnightCap/ Welcome to the KnightCap home page]</ref>.

=Selected Publications=
<ref>[https://dblp.uni-trier.de/pers/hd/w/Weaver:Lex dblp: Lex Weaver]</ref>
==1997 ...==
* [[Jonathan Baxter]], [[Andrew Tridgell]], [[Lex Weaver]] ('''1997'''). ''Knightcap: A chess program that learns by combining td(λ) with minimax search''. 15th International Conference on Machine Learning, [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.54.8263&rep=rep1&type=pdf pdf] via [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.54.8263 citeseerX]
* [[Jonathan Baxter]], [[Andrew Tridgell]], [[Lex Weaver]] ('''1998'''). ''Experiments in Parameter Learning Using Temporal Differences''. [[ICGA Journal#21_2|ICCA Journal, Vol. 21, No. 2]]
* [[Jonathan Baxter]], [[Andrew Tridgell]], [[Lex Weaver]] ('''1999'''). ''TDLeaf(lambda): Combining Temporal Difference Learning with Game-Tree Search''. [https://www.chatbots.org/journal/australian_journal_of_intelligent_information_processing_systems/ Australian Journal of Intelligent Information Processing Systems], Vol. 5 No. 1, [http://arxiv.org/abs/cs/9901001 arXiv:cs/9901001]
* [[Jonathan Baxter]], [[Andrew Tridgell]], [[Lex Weaver]] ('''1999'''). ''KnightCap: A chess program that learns by combining TD(lambda) with game-tree search''. [https://arxiv.org/abs/cs/9901002 arXiv:cs/9901002]
==2000 ...==
* [[Jonathan Baxter]], [[Andrew Tridgell]], [[Lex Weaver]] ('''2000'''). ''Learning to Play Chess Using Temporal Differences''. [http://www.dblp.org/db/journals/ml/ml40.html#BaxterTW00 Machine Learning, Vol 40, No. 3], [http://www.cs.princeton.edu/courses/archive/fall06/cos402/papers/chess-RL.pdf pdf]
* [[Lex Weaver]], [[Jonathan Baxter]] ('''2001'''). ''STD (λ): learning state differences with TD (λ)''. [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.20.7737 CiteSeerX]
==2010 ...==
* [[Peter Bartlett]], [[Jonathan Baxter]], [[Lex Weaver]] ('''2011'''). ''Experiments with Infinite-Horizon, Policy-Gradient Estimation''. [https://arxiv.org/abs/1106.0666 arXiv:1106.0666]
* [[Lex Weaver]], [https://dblp.uni-trier.de/pers/hd/t/Tao:Nigel Nigel Tao] ('''2013'''). ''The Optimal Reward Baseline for Gradient-Based Reinforcement Learning''. [https://arxiv.org/abs/1301.2315 arXiv:1301.2315]

=External Links=
* [https://www.linkedin.com/in/lex-weaver-6554bba9/ Lex Weaver | LinkedIn]
* [https://prabook.com/web/lex.weaver/226083 ex Weaver (born June 7, 1970), Australian educator, Computer scientist | Prabook]
* [https://scholar.google.com.au/citations?user=3DPKgJ4AAAAJ&hl=en Lex Weaver - Google Scholar Citations]

=References=
<references />

'''[[People|Up one level]]'''

GerdIsenberg

Bureaucrats, Administrators

25,161

edits

Changes

Lex Weaver

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools