Changes

Stuart Russell

No change in size, 22:37, 15 May 2020

no edit summary

* [https://dblp.uni-trier.de/pers/hd/h/Hadfield=Menell:Dylan Dylan Hadfield-Menell], [https://dblp.uni-trier.de/pers/hd/d/Dragan:Anca_D= Anca Dragan], [https://dblp.uni-trier.de/pers/hd/a/Abbeel:Pieter Pieter Abbeel], [[Stuart Russell]] ('''2016'''). ''The Off-Switch Game''. [https://arxiv.org/abs/1611.08219 arXiv:1611.08219]

* [https://dblp.uni-trier.de/pers/hd/w/Wang:Tongzhou Tongzhou Wan], [https://dblp.uni-trier.de/pers/hd/w/Wu:Yi Yi Wu], [https://dblp.uni-trier.de/pers/hd/m/Moore:David_A= David A. Moore], [[Stuart Russell]] ('''2017'''). ''Neural Block Sampling''. [https://arxiv.org/abs/1708.06040 arXiv:1708.06040]

* [https://dblp.uni-trier.de/pers/hd/h/Hadfield=Menell:Dylan Dylan Hadfield-Menell], [https://dblp.uni-trier.de/pers/hd/m/Milli:Smitha Smitha Milli], [https://dblp.uni-trier.de/pers/hd/a/Abbeel:Pieter Pieter Abbeel], [[Stuart Russell]], [https://dblp.uni-trier.de/pers/hd/d/Dragan:Anca_D= Anca Dragan] ('''2017'''). ''Inverse Reward Design**''. [https://arxiv.org/abs/1711.02827 arXiv:1711.02827]

* [https://dblp.uni-trier.de/pers/hd/m/Malik:Dhruv Dhruv Malik], [https://dblp.uni-trier.de/pers/hd/p/Palaniappan:Malayandi Malayandi Palaniappan], [https://dblp.uni-trier.de/pers/hd/f/Fisac:Jaime_F= Jaime F. Fisac], [https://dblp.uni-trier.de/pers/hd/h/Hadfield=Menell:Dylan Dylan Hadfield-Menell], [[Stuart Russell]], [https://dblp.uni-trier.de/pers/hd/d/Dragan:Anca_D= Anca Dragan] ('''2018'''). ''An Efficient, Generalized Bellman Update For Cooperative Inverse Reinforcement Learning''. [https://arxiv.org/abs/1806.03820 arXiv:1806.03820]

GerdIsenberg

Bureaucrats, Administrators

25,161

edits

Changes

Stuart Russell

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools