Search results

Jump to: navigation, search
  • ...and the action-values are represented discretely <ref>[[Chris Watkins]], [[Peter Dayan]] ('''1992'''). ''[http://www.gatsby.ucl.ac.uk/~dayan/papers/wd92.htm * [[Peter Dayan]] ('''1991'''). ''[https://www.era.lib.ed.ac.uk/handle/1842/14754 Rei
    54 KB (7,025 words) - 12:47, 14 March 2022
  • ...ttps://scholar.google.com/citations?user=W2DsnAkAAAAJ&hl=en Chris Dyer], [[Peter Dayan]], [[Timothy Lillicrap]] ('''2018'''). ''Fast Parametric Learning wit ..., [[Razvan Pascanu]], [[Matthew Botvinick]], [[Oriol Vinyals]], [[Peter W. Battaglia]] ('''2018'''). ''Relational Deep Reinforcement Learning''. [https://arxiv.
    10 KB (1,216 words) - 08:53, 17 April 2021
  • ..., [[Razvan Pascanu]], [[Matthew Botvinick]], [[Oriol Vinyals]], [[Peter W. Battaglia]] ('''2018'''). ''Relational Deep Reinforcement Learning''. [https://arxiv.
    5 KB (564 words) - 16:19, 30 May 2021