Volodymyr Mnih
Home * People * Volodymyr Mnih
Volodymyr Mnih,
a Canadian research scientist at at Google DeepMind with expertise in deep learning, heading the team working on deep Q-networks (DQN) mastering Atari games [2]. DQNs were tested with games such as Pong, Space Invaders, Breakout and Seaquest, receiving only the pixels and the game score as inputs, to surpass the performance of all previous algorithms and achieve a level comparable to that of a professional human games tester across a set of 49 games, using the same algorithm, network architecture and hyperparameters. Volodymyr Mnih holds a Ph.D. in machine learning from University of Toronto under supervision of Geoffrey E. Hinton, and a Master's degree in computing science fro University of Alberta where his advisor was Csaba Szepesvári.
Selected Publications
2008
- Volodymyr Mnih (2008). Efficient Stopping Rules. Masters thesis, University of Alberta, advisor: Csaba Szepesvári, pdf
2010 ...
- Volodymyr Mnih, Geoffrey E. Hinton (2010). Learning to Detect Roads in High-Resolution Aerial Images. ECCV 2010
- Volodymyr Mnih (2013). Machine Learning for Aerial Image Labeling. Ph.D. thesis, University of Toronto, advisor Geoffrey E. Hinton, pdf
- Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller (2013). Playing Atari with Deep Reinforcement Learning. arXiv:1312.5602 [4]
2015 ...
- Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, Demis Hassabis (2015). Human-level control through deep reinforcement learning. Nature, Vol. 518
- Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy Lillicrap, Tim Harley, David Silver, Koray Kavukcuoglu (2016). Asynchronous Methods for Deep Reinforcement Learning. arXiv:1602.01783v2
- Max Jaderberg, Volodymyr Mnih, Wojciech Marian Czarnecki, Tom Schaul, Joel Z. Leibo, David Silver, Koray Kavukcuoglu (2016). Reinforcement Learning with Unsupervised Auxiliary Tasks. arXiv:1611.05397v1
- Hado van Hasselt, Arthur Guez, Matteo Hessel, Volodymyr Mnih, David Silver (2016). Learning values across many orders of magnitude. arXiv:1602.07714v2, NIPS 2016
External Links
References
- ↑ Vlad Mnih - Homepage
- ↑ We’ll Never Win! Google’s AI Plays Atari - Propecta by Nate Dame, March 19, 2015
- ↑ dblp: Volodymyr Mnih
- ↑ Demystifying Deep Reinforcement Learning by Tambet Matiisen, Nervana, December 21, 2015