Changes

Jump to: navigation, search

Stuart Russell

No change in size, 22:37, 15 May 2020
no edit summary
* [https://dblp.uni-trier.de/pers/hd/h/Hadfield=Menell:Dylan Dylan Hadfield-Menell], [https://dblp.uni-trier.de/pers/hd/d/Dragan:Anca_D= Anca Dragan], [https://dblp.uni-trier.de/pers/hd/a/Abbeel:Pieter Pieter Abbeel], [[Stuart Russell]] ('''2016'''). ''The Off-Switch Game''. [https://arxiv.org/abs/1611.08219 arXiv:1611.08219]
* [https://dblp.uni-trier.de/pers/hd/w/Wang:Tongzhou Tongzhou Wan], [https://dblp.uni-trier.de/pers/hd/w/Wu:Yi Yi Wu], [https://dblp.uni-trier.de/pers/hd/m/Moore:David_A= David A. Moore], [[Stuart Russell]] ('''2017'''). ''Neural Block Sampling''. [https://arxiv.org/abs/1708.06040 arXiv:1708.06040]
* [https://dblp.uni-trier.de/pers/hd/h/Hadfield=Menell:Dylan Dylan Hadfield-Menell], [https://dblp.uni-trier.de/pers/hd/m/Milli:Smitha Smitha Milli], [https://dblp.uni-trier.de/pers/hd/a/Abbeel:Pieter Pieter Abbeel], [[Stuart Russell]], [https://dblp.uni-trier.de/pers/hd/d/Dragan:Anca_D= Anca Dragan] ('''2017'''). ''Inverse Reward Design**''. [https://arxiv.org/abs/1711.02827 arXiv:1711.02827]
* [https://dblp.uni-trier.de/pers/hd/m/Malik:Dhruv Dhruv Malik], [https://dblp.uni-trier.de/pers/hd/p/Palaniappan:Malayandi Malayandi Palaniappan], [https://dblp.uni-trier.de/pers/hd/f/Fisac:Jaime_F= Jaime F. Fisac], [https://dblp.uni-trier.de/pers/hd/h/Hadfield=Menell:Dylan Dylan Hadfield-Menell], [[Stuart Russell]], [https://dblp.uni-trier.de/pers/hd/d/Dragan:Anca_D= Anca Dragan] ('''2018'''). ''An Efficient, Generalized Bellman Update For Cooperative Inverse Reinforcement Learning''. [https://arxiv.org/abs/1806.03820 arXiv:1806.03820]

Navigation menu