Yasuhiro Osaki

From Chessprogramming wiki
Jump to: navigation, search

Home * People * Yasuhiro Osaki

Yasuhiro Osaki [1]

Yasuhiro Osaki,
a Japanese software engineer and computer scientist at Sony. Until 2010, Yasuhiro Osaki was affiliated with the laboratory of professor Yoshiyuki Kotani at the Tokyo University of Agriculture and Technology.


Yasuhiro Osaki's research was about reinforcement learning and the application of TD(λ) based on Monte-Carlo simulations in computer games. The program committee of the 12th Game Programming Workshop 2007 gave the best presentation award to Yasuhiro Osaki on TD(λ)-MC, a reinforcement learning approach with Monte-carlo simulations [2] [3].

Selected Publications


External Links


  1. YasuhiroOsaki (Yasuhiro Osaki) · GitHub
  2. Yasuhiro Osaki, Kazutomo Shibahara, Yasuhiro Tajima, Yoshiyuki Kotani (2007). Reinforcement Learning of Evaluation Functions Using Temporal Difference-Monte Carlo learning method. 12th Game Programming Workshop
  3. TD-Lamda from Wikipedia
  4. dblp: Yasuhiro Osaki

Up one level