Difference between revisions of "Nicol N. Schraudolph"

From Chessprogramming wiki
Jump to: navigation, search
Line 10: Line 10:
  
 
=Learning Go=  
 
=Learning Go=  
Along with [[Peter Dayan]] and his advisor, [[Terrence J. Sejnowski]], Nicol Schraudolph applied [[Temporal Difference Learning|temporal difference learning]] to an [[Evaluation|evaluation function]] in the game of [[Go]], as published in 1994 <ref>[[Nicol N. Schraudolph]], [[Peter Dayan]], [[Terrence J. Sejnowski]] ('''1994'''). ''[https://nic.schraudolph.org/bib2html/b2hd-SchDaySej94.html Temporal Difference Learning of Position Evaluation in the Game of Go]''. [http://papers.nips.cc/book/advances-in-neural-information-processing-systems-6-1993 Advances in Neural Information Processing Systems 6]</ref>, and in 2001 <ref>[[Nicol N. Schraudolph]], [[Peter Dayan]], [[Terrence J. Sejnowski]] ('''2001'''). ''[http://nic.schraudolph.org/bib2html/b2hd-SchDaySej01.html Learning to Evaluate Go Positions via Temporal Difference Methods]''. in  [[Norio Baba]], [[Lakhmi C. Jain]] (eds.) ('''2001'''). ''[http://jasss.soc.surrey.ac.uk/7/1/reviews/takama.html Computational Intelligence in Games, Studies in Fuzziness and Soft Computing]''. [http://www.springer.com/economics?SGWID=1-165-6-73481-0 Physica-Verlag]</ref>.
+
Along with [[Peter Dayan]] and his advisor, [[Terrence J. Sejnowski]], Nicol Schraudolph applied [[Temporal Difference Learning|temporal difference learning]] to an [[Evaluation|evaluation function]] in the game of [[Go]], as published in 1993 <ref>[[Nicol N. Schraudolph]], [[Peter Dayan]], [[Terrence J. Sejnowski]] ('''1993'''). ''[https://papers.nips.cc/paper/820-temporal-difference-learning-of-position-evaluation-in-the-game-of-go Temporal Difference Learning of Position Evaluation in the Game of Go]''. [https://papers.nips.cc/book/advances-in-neural-information-processing-systems-6-1993 NIPS 1993]</ref>, and in 2001 <ref>[[Nicol N. Schraudolph]], [[Peter Dayan]], [[Terrence J. Sejnowski]] ('''2001'''). ''[http://nic.schraudolph.org/bib2html/b2hd-SchDaySej01.html Learning to Evaluate Go Positions via Temporal Difference Methods]''. [http://jasss.soc.surrey.ac.uk/7/1/reviews/takama.html Computational Intelligence in Games, Studies in Fuzziness and Soft Computing]. [http://www.springer.com/economics?SGWID=1-165-6-73481-0 Physica-Verlag]</ref>.
 
[[FILE:gotemporaldiffeval.jpg|none|border|text-bottom|543px|link=http://nic.schraudolph.org/bib2html/b2hd-SchDaySej94.html]]  
 
[[FILE:gotemporaldiffeval.jpg|none|border|text-bottom|543px|link=http://nic.schraudolph.org/bib2html/b2hd-SchDaySej94.html]]  
 
A modular network architecture that takes advantage of board symmetries,<br/>
 
A modular network architecture that takes advantage of board symmetries,<br/>

Revision as of 17:00, 11 June 2019

Home * People * Nicol N. Schraudolph

Nic Schraudolph [1]

Nicol N. (Nic) Schraudolph,
a German computer scientist, independent researcher and consultant in machine learning and optimization, in particular stochastic gradient descent. He received a B.Sc. degree in computer science from the University of Essex in 1988, and M.Sc. and Ph.D. degrees in computer science and cognitive science from University of California, San Diego, the Ph.D. in 1995 on optimization of entropy with neural networks under Terrence J. Sejnowski [2]. His previous research affiliations include NICTA [1] Canberra], ETH Zurich and Dalle Molle Institute for Artificial Intelligence Research, Lugano.

Learning Go

Along with Peter Dayan and his advisor, Terrence J. Sejnowski, Nicol Schraudolph applied temporal difference learning to an evaluation function in the game of Go, as published in 1993 [3], and in 2001 [4].

Gotemporaldiffeval.jpg

A modular network architecture that takes advantage of board symmetries,
translation invariance and localized reinforcement to evaluate Go positions [5]

Selected Publications

[6] [7] [8]

1991 ...

2000 ...

2010 ...

External Links

References

Up one level