Changes

Jump to: navigation, search

Timothy Lillicrap

26 bytes added, 20:58, 14 November 2019
no edit summary
* [[Shixiang Gu]], [[Timothy Lillicrap]], [[Ilya Sutskever]], [[Sergey Levine]] ('''2016'''). ''Continuous Deep Q-Learning with Model-based Acceleration''. [https://arxiv.org/abs/1603.00748 arXiv:1603.00748] <ref>[https://en.wikipedia.org/wiki/Q-learning Q-learning from Wikipedia]</ref>
* [[Shixiang Gu]], [[Ethan Holly]], [[Timothy Lillicrap]], [[Sergey Levine]] ('''2016'''). ''Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates''. [https://arxiv.org/abs/1610.00633 arXiv:1610.00633]
* [[Shixiang Gu]], [[Timothy Lillicrap]], [[Mathematician#ZGhahramani|Zoubin Ghahramani]], [[Richard E. Turner]], [[Sergey Levine]] ('''2016'''). ''Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic''. [https://arxiv.org/abs/1611.02247 arXiv:1611.02247]
'''2017'''
* [[Yutian Chen]], [[Matthew W. Hoffman]], [[Sergio Gomez Colmenarejo]], [[Misha Denil]], [[Timothy Lillicrap]], [[Matthew Botvinick]], [[Nando de Freitas]] ('''2017'''). ''Learning to Learn without Gradient Descent by Gradient Descent''. [https://arxiv.org/abs/1611.03824v6 arXiv:1611.03824v6], [http://dblp.uni-trier.de/db/conf/icml/icml2017.html ICML 2017]

Navigation menu