Changes

← Older edit

Csaba Szepesvári

252 bytes added, 23:16, 12 April 2021

no edit summary

* [[István Szita]], [[Csaba Szepesvári]] ('''2011'''). ''Agnostic KWIK learning and efficient approximate reinforcement learning''. [http://www.informatik.uni-trier.de/~ley/db/journals/jmlr/jmlrp19.html#SzitaS11 Journal of Machine Learning Research - Proceedings Track 19]

* [[Sylvain Gelly]], [[Marc Schoenauer]], [[Michèle Sebag]], [[Olivier Teytaud]], [[Levente Kocsis]], [[David Silver]], [[Csaba Szepesvári]] ('''2012'''). ''[http://dl.acm.org/citation.cfm?id=2093548.2093574 The Grand Challenge of Computer Go: Monte Carlo Tree Search and Extensions]''. [[ACM#Communications|Communications of the ACM]], Vol. 55, No. 3, [http://www0.cs.ucl.ac.uk/staff/D.Silver/web/Applications_files/grand-challenge.pdf pdf preprint]

* [https://scholar.google.com/citations?user=iuCvTuIAAAAJ&hl=en Mahdi Milani Fard], [[Joelle Pineau]], [[Csaba Szepesvári]] ('''2012'''). ''PAC-Bayesian Policy Evaluation for Reinforcement Learning''. [https://arxiv.org/abs/1202.3717 arXiv:1202.3717]

==2015 ...==

* [[Tor Lattimore]], [[Csaba Szepesvári]] ('''2017'''). ''The End of Optimism? An Asymptotic Analysis of Finite-Armed Linear Bandits''. [https://www.aistats.org/aistats2017/ AISTATS], [https://sites.ualberta.ca/~szepesva/papers/linbandits_aistats17.pdf pdf]

GerdIsenberg

Bureaucrats, Administrators

25,161

edits

Changes

Csaba Szepesvári

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools