Changes

Jump to: navigation, search

Yasuhiro Osaki

2,776 bytes added, 13:09, 7 March 2019
Created page with "'''Home * People * Yasuhiro Osaki''' FILE:YasuhiroOsaki.png|border|right|thumb|link=https://github.com/YasuhiroOsaki/| Yasuhiro Osaki <ref>[https://github..."
'''[[Main Page|Home]] * [[People]] * Yasuhiro Osaki'''

[[FILE:YasuhiroOsaki.png|border|right|thumb|link=https://github.com/YasuhiroOsaki/| Yasuhiro Osaki <ref>[https://github.com/YasuhiroOsaki YasuhiroOsaki (Yasuhiro Osaki) · GitHub]</ref> ]]

'''Yasuhiro Osaki''',<br/>
a Japanese software engineer and computer scientist at [https://en.wikipedia.org/wiki/Sony Sony]. Until 2010, Yasuhiro Osaki was affiliated with the laboratory of professor [[Yoshiyuki Kotani]] at the [https://en.wikipedia.org/wiki/Tokyo_University_of_Agriculture_and_Technology Tokyo University of Agriculture and Technology].

=TD(λ)-MC=
Yasuhiro Osaki's research was about [[Reinforcement Learning|reinforcement learning]] and the application of [[Temporal Difference Learning|TD(λ)]] based on [https://en.wikipedia.org/wiki/Monte_Carlo_method Monte-Carlo simulations] in computer games. The program committee of the [[Conferences#GPW|12th Game Programming Workshop 2007]] gave the best presentation award to Yasuhiro Osaki on '''TD(λ)-MC''', a reinforcement learning approach with Monte-carlo simulations <ref>[[Yasuhiro Osaki]], [[Kazutomo Shibahara]], [[Yasuhiro Tajima]], [[Yoshiyuki Kotani]] ('''2007'''). ''Reinforcement Learning of Evaluation Functions Using Temporal Difference-Monte Carlo learning method''. [[Conferences#GPW|12th Game Programming Workshop]], [http://www.tuat.ac.jp/~kotani/index.php?plugin=attach&pcmd=open&file=osaki0711TDMC-GPW%29.pdf&refer=lab%2Fpapers%2Fdepot pdf] (Japanese)</ref> <ref>[https://en.wikipedia.org/wiki/Temporal_difference_learning#Mathematical_formulation TD-Lamda from Wikipedia]</ref>.

=Selected Publications=
<ref>[https://dblp.uni-trier.de/pers/hd/o/Osaki:Yasuhiro dblp: Yasuhiro Osaki]</ref>
* [[Yasuhiro Osaki]], [[Kazutomo Shibahara]], [[Yasuhiro Tajima]], [[Yoshiyuki Kotani]] ('''2007'''). ''Reinforcement Learning of Evaluation Functions Using Temporal Difference-Monte Carlo learning method''. [[Conferences#GPW|12th Game Programming Workshop]]
* [[Yasuhiro Osaki]], [[Kazutomo Shibahara]], [[Yasuhiro Tajima]], [[Yoshiyuki Kotani]] ('''2008'''). ''An Othello Evaluation Function Based on Temporal Difference Learning using Probability of Winning''. [http://www.csse.uwa.edu.au/cig08/Proceedings/toc.html CIG'08], [http://www.csse.uwa.edu.au/cig08/Proceedings/papers/8010.pdf pdf]
* [[Yasuhiro Osaki]], [[Yoshiyuki Kotani]] ('''2009'''). ''A Learning Method of Evaluation Function Based on Selective Simulations''. [[Conferences#GPW|14th Game Programming Workshop]]

=External Links=
* [https://www.linkedin.com/in/yasuhiro-osaki-769025b2/ Yasuhiro Osaki | LinkedIn]
* [https://github.com/YasuhiroOsaki YasuhiroOsaki (Yasuhiro Osaki) · GitHub]

=References=
<references />
'''[[People|Up one level]]'''
[[Category:Researcher|Osaki]]

Navigation menu