Changes

Jump to: navigation, search

Nicol N. Schraudolph

62 bytes removed, 17:00, 11 June 2019
no edit summary
=Learning Go=
Along with [[Peter Dayan]] and his advisor, [[Terrence J. Sejnowski]], Nicol Schraudolph applied [[Temporal Difference Learning|temporal difference learning]] to an [[Evaluation|evaluation function]] in the game of [[Go]], as published in 1994 1993 <ref>[[Nicol N. Schraudolph]], [[Peter Dayan]], [[Terrence J. Sejnowski]] ('''19941993'''). ''[https://nicpapers.schraudolphnips.orgcc/bib2htmlpaper/b2hd820-temporal-difference-learning-of-position-evaluation-in-the-game-of-SchDaySej94.html go Temporal Difference Learning of Position Evaluation in the Game of Go]''. [httphttps://papers.nips.cc/book/advances-in-neural-information-processing-systems-6-1993 Advances in Neural Information Processing Systems 6NIPS 1993]</ref>, and in 2001 <ref>[[Nicol N. Schraudolph]], [[Peter Dayan]], [[Terrence J. Sejnowski]] ('''2001'''). ''[http://nic.schraudolph.org/bib2html/b2hd-SchDaySej01.html Learning to Evaluate Go Positions via Temporal Difference Methods]''. in [[Norio Baba]], [[Lakhmi C. Jain]] (eds.) ('''2001'''). ''[http://jasss.soc.surrey.ac.uk/7/1/reviews/takama.html Computational Intelligence in Games, Studies in Fuzziness and Soft Computing]''. [http://www.springer.com/economics?SGWID=1-165-6-73481-0 Physica-Verlag]</ref>.
[[FILE:gotemporaldiffeval.jpg|none|border|text-bottom|543px|link=http://nic.schraudolph.org/bib2html/b2hd-SchDaySej94.html]]
A modular network architecture that takes advantage of board symmetries,<br/>

Navigation menu