Latest revision as of 17:01, 4 July 2020

Home * People * Volodymyr Mnih

Volodymyr Mnih ^[1]

Volodymyr Mnih,
a Canadian research scientist at at Google DeepMind with expertise in deep learning, heading the team working on deep Q-networks (DQN) mastering Atari games ^[2]. DQNs were tested with games such as Pong, Space Invaders, Breakout and Seaquest, receiving only the pixels and the game score as inputs, to surpass the performance of all previous algorithms and achieve a level comparable to that of a professional human games tester across a set of 49 games, using the same algorithm, network architecture and hyperparameters. Volodymyr Mnih holds a Ph.D. in machine learning from University of Toronto under supervision of Geoffrey E. Hinton, and a Master's degree in computing science fro University of Alberta where his advisor was Csaba Szepesvári.

Selected Publications

^[3]

2008

Volodymyr Mnih (2008). Efficient Stopping Rules. Masters thesis, University of Alberta, advisor: Csaba Szepesvári, pdf

2010 ...

Volodymyr Mnih, Geoffrey E. Hinton (2010). Learning to Detect Roads in High-Resolution Aerial Images. ECCV 2010
Volodymyr Mnih (2013). Machine Learning for Aerial Image Labeling. Ph.D. thesis, University of Toronto, advisor Geoffrey E. Hinton, pdf
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller (2013). Playing Atari with Deep Reinforcement Learning. arXiv:1312.5602 ^[4]

2015 ...

Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, Demis Hassabis (2015). Human-level control through deep reinforcement learning. Nature, Vol. 518
Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy Lillicrap, Tim Harley, David Silver, Koray Kavukcuoglu (2016). Asynchronous Methods for Deep Reinforcement Learning. arXiv:1602.01783v2
Max Jaderberg, Volodymyr Mnih, Wojciech Marian Czarnecki, Tom Schaul, Joel Z. Leibo, David Silver, Koray Kavukcuoglu (2016). Reinforcement Learning with Unsupervised Auxiliary Tasks. arXiv:1611.05397v1
Hado van Hasselt, Arthur Guez, Matteo Hessel, Volodymyr Mnih, David Silver (2016). Learning values across many orders of magnitude. arXiv:1602.07714v2, NIPS 2016

External Links

References

↑ Vlad Mnih - Homepage
↑ We’ll Never Win! Google’s AI Plays Atari - Propecta by Nate Dame, March 19, 2015
↑ dblp: Volodymyr Mnih
↑ Demystifying Deep Reinforcement Learning by Tambet Matiisen, Nervana, December 21, 2015

Up one level

[1] Vlad Mnih - Homepage

[2] We’ll Never Win! Google’s AI Plays Atari - Propecta by Nate Dame, March 19, 2015

[3] : Volodymyr Mnih

[4] Demystifying Deep Reinforcement Learning by Tambet Matiisen, Nervana, December 21, 2015

[1]

[2]

[3]

[4]

@@ Line 15: / Line 15: @@
 * [[Volodymyr Mnih]], [[Koray Kavukcuoglu]], [[David Silver]], [[Alex Graves]], [[Ioannis Antonoglou]], [[Daan Wierstra]], [[Martin Riedmiller]] ('''2013'''). ''Playing Atari with Deep Reinforcement Learning''. [http://arxiv.org/abs/1312.5602 arXiv:1312.5602] <ref>[http://www.nervanasys.com/demystifying-deep-reinforcement-learning/ Demystifying Deep Reinforcement Learning] by [http://www.nervanasys.com/author/tambet/ Tambet Matiisen], [http://www.nervanasys.com/ Nervana], December 21, 2015</ref>
 ==2015 ...==
-* [[Volodymyr Mnih]], [[Koray Kavukcuoglu]], [[David Silver]], [[Andrei A. Rusu]], [[Joel Veness]], [[Marc G. Bellemare]], [[Alex Graves]], [[Martin Riedmiller]], [[Andreas K. Fidjeland]], [[Georg Ostrovski]], [[Stig Petersen]], [[Charles Beattie]], [[Amir Sadik]], [[Ioannis Antonoglou]], [[Helen King]], [[Dharshan Kumaran]], [[Daan Wierstra]], [[Shane Legg]], [[Demis Hassabis]] ('''2015'''). ''[http://www.nature.com/nature/journal/v518/n7540/abs/nature14236.html Human-level control through deep reinforcement learning]''. [https://en.wikipedia.org/wiki/Nature_%28journal%29 Nature], Vol. 518
+* [[Volodymyr Mnih]], [[Koray Kavukcuoglu]], [[David Silver]], [[Mathematician#AARusu|Andrei A. Rusu]], [[Joel Veness]], [[Marc G. Bellemare]], [[Alex Graves]], [[Martin Riedmiller]], [[Andreas K. Fidjeland]], [[Georg Ostrovski]], [[Stig Petersen]], [[Charles Beattie]], [[Amir Sadik]], [[Ioannis Antonoglou]], [[Helen King]], [[Dharshan Kumaran]], [[Daan Wierstra]], [[Shane Legg]], [[Demis Hassabis]] ('''2015'''). ''[http://www.nature.com/nature/journal/v518/n7540/abs/nature14236.html Human-level control through deep reinforcement learning]''. [https://en.wikipedia.org/wiki/Nature_%28journal%29 Nature], Vol. 518
 * [[Volodymyr Mnih]], [[Adrià Puigdomènech Badia]], [[Mehdi Mirza]], [[Alex Graves]], [[Timothy Lillicrap]], [[Tim Harley]], [[David Silver]], [[Koray Kavukcuoglu]] ('''2016'''). ''Asynchronous Methods for Deep Reinforcement Learning''.  [https://arxiv.org/abs/1602.01783 arXiv:1602.01783v2]
 * [[Max Jaderberg]], [[Volodymyr Mnih]], [[Wojciech Marian Czarnecki]], [[Tom Schaul]], [[Joel Z. Leibo]], [[David Silver]], [[Koray Kavukcuoglu]] ('''2016'''). ''Reinforcement Learning with Unsupervised Auxiliary Tasks''. [https://arxiv.org/abs/1611.05397v1 arXiv:1611.05397v1]
@@ Line 28: / Line 28: @@
 '''[[People|Up one level]]'''
+[[Category:Researcher|Mnih]]

Difference between revisions of "Volodymyr Mnih"

Latest revision as of 17:01, 4 July 2020

Contents

Selected Publications

2008

2010 ...

2015 ...

External Links

References

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools