Difference between revisions of "NeuroChess"

From Chessprogramming wiki
Jump to: navigation, search
Line 23: Line 23:
 
=Publications=  
 
=Publications=  
 
* [[Sebastian Thrun]] ('''1995'''). ''[http://robots.stanford.edu/papers/thrun.nips7.neuro-chess.html Learning to Play the Game of Chess]''. in [[Gerald Tesauro]], [https://en.wikipedia.org/wiki/David_S._Touretzky David S. Touretzky], [http://mitpress.mit.edu/authors/todd-k-leen Todd K. Leen] (eds.) Advances in Neural Information Processing Systems 7, [https://en.wikipedia.org/wiki/MIT_Press MIT Press], [http://papers.nips.cc/paper/1007-learning-to-play-the-game-of-chess.pdf pdf]
 
* [[Sebastian Thrun]] ('''1995'''). ''[http://robots.stanford.edu/papers/thrun.nips7.neuro-chess.html Learning to Play the Game of Chess]''. in [[Gerald Tesauro]], [https://en.wikipedia.org/wiki/David_S._Touretzky David S. Touretzky], [http://mitpress.mit.edu/authors/todd-k-leen Todd K. Leen] (eds.) Advances in Neural Information Processing Systems 7, [https://en.wikipedia.org/wiki/MIT_Press MIT Press], [http://papers.nips.cc/paper/1007-learning-to-play-the-game-of-chess.pdf pdf]
* [[Johannes Fürnkranz]], [[Miroslav Kubat]] ('''2001'''). ''[https://www.novapublishers.com/catalog/product_info.php?products_id=720 Machines that Learn to Play Games]''. Advances in Computation: Theory and Practice, Vol. 8,. [https://en.wikipedia.org/wiki/Nova_Publishers NOVA Science Publishers]
+
* [[Johannes Fürnkranz]], [[Miroslav Kubat]] (eds.) ('''2001'''). ''[http://www.novapublishers.org/catalog/product_info.php?products_id=720 Machines that Learn to Play Games]''. Advances in Computation: Theory and Practice, Vol. 8,. [https://en.wikipedia.org/wiki/Nova_Publishers NOVA Science Publishers]
* [[Jacek Mańdziuk]] ('''2007'''). ''[http://link.springer.com/chapter/10.1007/978-3-540-71984-7_15 Computational Intelligence in Mind Games]''. in [[Włodzisław Duch]], [[Jacek Mańdziuk]] (eds.) ''[http://link.springer.com/book/10.1007%2F978-3-540-71984-7 Challenges for Computational Intelligence]''. [http://link.springer.com/bookseries/7092 Studies in Computational Intelligence], Vol. 63, [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer]
+
* [[Jacek Mańdziuk]] ('''2007'''). ''[https://link.springer.com/chapter/10.1007/978-3-540-71984-7_15 Computational Intelligence in Mind Games]''. in [[Włodzisław Duch]], [[Jacek Mańdziuk]] (eds.) ''[https://link.springer.com/book/10.1007%2F978-3-540-71984-7 Challenges for Computational Intelligence]''. [https://link.springer.com/bookseries/7092 Studies in Computational Intelligence], Vol. 63, [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer]
 
* [[Marco Block-Berlitz|Marco Block]], Maro Bader, [http://page.mi.fu-berlin.de/tapia/ Ernesto Tapia], Marte Ramírez, Ketill Gunnarsson, Erik Cuevas, Daniel Zaldivar, [[Raúl Rojas]] ('''2008'''). ''Using Reinforcement Learning in Chess Engines''. Concibe Science 2008, [http://www.micai.org/rcs/ Research in Computing Science]: Special Issue in Electronics and Biomedical Engineering, Computer Science and Informatics, Vol. 35, [http://page.mi.fu-berlin.de/block/concibe2008.pdf pdf]
 
* [[Marco Block-Berlitz|Marco Block]], Maro Bader, [http://page.mi.fu-berlin.de/tapia/ Ernesto Tapia], Marte Ramírez, Ketill Gunnarsson, Erik Cuevas, Daniel Zaldivar, [[Raúl Rojas]] ('''2008'''). ''Using Reinforcement Learning in Chess Engines''. Concibe Science 2008, [http://www.micai.org/rcs/ Research in Computing Science]: Special Issue in Electronics and Biomedical Engineering, Computer Science and Informatics, Vol. 35, [http://page.mi.fu-berlin.de/block/concibe2008.pdf pdf]
* [[Jacek Mańdziuk]] ('''2010'''). ''[http://link.springer.com/book/10.1007%2F978-3-642-11678-0 Knowledge-Free and Learning-Based Methods in Intelligent Game Playing]''. [http://link.springer.com/bookseries/7092 Studies in Computational Intelligence], Vol. 276, [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer], pp. 124
+
* [[Jacek Mańdziuk]] ('''2010'''). ''[https://link.springer.com/book/10.1007%2F978-3-642-11678-0 Knowledge-Free and Learning-Based Methods in Intelligent Game Playing]''. [http://link.springer.com/bookseries/7092 Studies in Computational Intelligence], Vol. 276, [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer], pp. 124
* [[István Szita]] ('''2012'''). ''[http://link.springer.com/chapter/10.1007%2F978-3-642-27645-3_17 Reinforcement Learning in Games]''. in [[Marco Wiering]], [http://martijnvanotterlo.nl/ Martijn Van Otterlo] (eds.). ''[https://scholar.google.com/citations?view_op=view_citation&hl=en&user=xVas0I8AAAAJ&citation_for_view=xVas0I8AAAAJ:abG-DnoFyZgC Reinforcement learning: State-of-the-art]''. [http://link.springer.com/book/10.1007/978-3-642-27645-3 Adaptation, Learning, and Optimization, Vol. 12], [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer]
+
* [[István Szita]] ('''2012'''). ''[https://link.springer.com/chapter/10.1007%2F978-3-642-27645-3_17 Reinforcement Learning in Games]''. in [[Marco Wiering]], [http://martijnvanotterlo.nl/ Martijn Van Otterlo] (eds.). ''[https://scholar.google.com/citations?view_op=view_citation&hl=en&user=xVas0I8AAAAJ&citation_for_view=xVas0I8AAAAJ:abG-DnoFyZgC Reinforcement learning: State-of-the-art]''. [https://link.springer.com/book/10.1007/978-3-642-27645-3 Adaptation, Learning, and Optimization, Vol. 12], [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer]
  
 
=Forum Posts=  
 
=Forum Posts=  

Revision as of 15:04, 4 March 2019

Home * Engines * NeuroChess

NeuroChess,
an experimental program which learns to play chess from the final outcome of games written by Sebastian Thrun in the early mid 90s, at that time at Carnegie Mellon University supported by Tom Mitchell and in particular on the domain of chess advised by Hans Berliner [1] . NeuroChess took its search algorithm from GNU Chess but its evaluation was based on neural networks to integrate inductive neural network learning, temporal difference learning, and a variant of explanation-based learning [2] [3] .

Architecture

Using the raw board representaion as input representation of a neural network is a poor choice, since small changes on the board can cause huge differences in value contrasting the smooth nature of neural network representations. Therefore NeuroChess maps its internal board into a set of 175 carefully designed features. A specially trained, but otherwise conventional three layer feed forward neural network (V) with one final output maps the feature vector to an evaluation score.

The evaluation network V was trained by backpropagation and the temporal difference method (TD(0)), utilizing a second neural network, an explanation based neural network (EBNN) dubbed the chess model M, which maps 175 inputs via 165 hidden units to 175 outputs, and aims to predict the (important) board features two plies ahead. This network M is trained independently prior to V to incorporate supervised domain knowledge.

NeuroChess.jpg

Learning an evaluation function in NeuroChess. Boards are mapped into a high-dimensional Feature vector, which forms the input for both the evaluation network V and the chess model M. The evaluation network is trained by Backpropagation and the TD(O) procedure. Both networks are employed for analyzing training example in order to derive target slopes for V. [4]

Conclusion

After training M with 120,000 grandmaster games, and training V with a further 2400 games, NeuroChess managed to beat GNU Chess in about 13% of the time at fixed depth 2 games, but only in 10% without EBNN. Further, computing a large neural network function took two orders of magnitude longer than evaluating the linear evaluation function of GNU Chess.

See also

Publications

Forum Posts

External Links

References

  1. Sebastian Thrun (1995). Learning to Play the Game of Chess. Acknowledgment
  2. Sebastian Thrun, Tom Mitchell (1993). Integrating Inductive Neural Network Learning and Explanation-Based Learning. IJCAI 1993, zipped ps
  3. Sebastian Thrun (1995). Explanation-Based Neural Network Learning - A Lifelong Learning Approach. Ph.D. thesis, University of Bonn
  4. Figure 2 from Sebastian Thrun (1995). Learning to Play the Game of Chess. pdf

Up one level