Changes

Jump to: navigation, search

Tor Lattimore

368 bytes added, 22:40, 17 January 2019
no edit summary
* [[Tor Lattimore]], [[Marcus Hutter]] ('''2012'''). ''PAC Bounds for Discounted MDPs''. [http://www.informatik.uni-trier.de/~ley/db/conf/alt/alt2012.htm Algorithmic Learning Theory], [https://en.wikipedia.org/wiki/Lecture_Notes_in_Computer_Science Lecture Notes in Computer Science], [https://en.wikipedia.org/wiki/Springer-Verlag Springer] <ref>[https://en.wikipedia.org/wiki/Markov_decision_process Markov decision process from Wikipedia]</ref>
* [[Tor Lattimore]], [[Marcus Hutter]] ('''2014'''). ''[https://link.springer.com/chapter/10.1007/978-3-319-11662-4_13 Bayesian Reinforcement Learning with Exploration]''. [http://dblp.uni-trier.de/db/conf/alt/alt2014.html Algorithmic Learning Theory], [https://en.wikipedia.org/wiki/Lecture_Notes_in_Computer_Science Lecture Notes in Computer Science] 8776, [https://en.wikipedia.org/wiki/Springer_Science%2BBusiness_Media Springer]
* [[Tor Lattimore]], [[Remi Rémi Munos]] ('''2014'''). ''Bounded Regret for Finite-Armed Structured Bandits''. [https://arxiv.org/abs/1411.2919 arXiv:1411.2919]
==2015 ...==
* [[Tor Lattimore]] ('''2015'''). ''Optimally Confident UCB: Improved Regret for Finite-Armed Bandits''. [https://arxiv.org/abs/1507.07880 arXiv:1507.07880]
* [[Tom Everitt]], [[Tor Lattimore]], [[Marcus Hutter]] ('''2016'''). ''Free Lunch for Optimisation under the Universal Distribution''. [https://arxiv.org/abs/1608.04544 arXiv:1608.04544]
* [[Tor Lattimore]], [[Csaba Szepesvári]] ('''2017'''). ''The End of Optimism? An Asymptotic Analysis of Finite-Armed Linear Bandits''. [https://www.aistats.org/aistats2017/ AISTATS], [https://sites.ualberta.ca/~szepesva/papers/linbandits_aistats17.pdf pdf], [https://arxiv.org/abs/1610.04491 arXiv:1610.04491] (2016)
* [[Joel Veness]], [[Tor Lattimore]], [https://github.com/avishkar58 Avishkar Bhoopchand], [https://scholar.google.co.uk/citations?user=mB4yebIAAAAJ&hl=en Agnieszka Grabska-Barwinska], [https://dblp.org/pers/hd/m/Mattern:Christopher Christopher Mattern], [https://dblp.org/pers/hd/t/Toth:Peter Peter Toth] ('''2017'''). ''Online Learning with Gated Linear Networks''. [https://arxiv.org/abs/1712.01897 arXiv:1712.018901897]
* [[Tor Lattimore]], [[Csaba Szepesvári]] ('''2018'''). ''Cleaning up the neighborhood: A full classification for adversarial partial monitoring''. [https://arxiv.org/abs/1805.09247 arXiv:1805.09247]
* [[Tor Lattimore]], [[Csaba Szepesvári]] ('''2019'''). ''Bandit Algorithms''. Cambridge University Press (draft), [http://downloads.tor-lattimore.com/banditbook/book.pdf pdf]
* [https://www.stmintz.com/ccc/index.php?id=373656 pawn hash] by [[Tor Lattimore]], [[CCC]], July 03, 2004 » [[Pawn Hash Table]]
* [https://www.stmintz.com/ccc/index.php?id=381595 MTD Drivers] by [[Tor Lattimore]], [[CCC]], August 10, 2004 » [[MTD(f)]]
* [https://www.stmintz.com/ccc/index.php?id=381931 Verified Null-moving] by [[Tor Lattimore]], [[CCC]], August 12, 2004 » [[Null Move Pruning#ZugzwangVerification|Verified Null Move Pruning]]
* [https://www.stmintz.com/ccc/index.php?id=385027 Qsearch Checks] by [[Tor Lattimore]], [[CCC]], August 29, 2004 » [[Quiescence Search]], [[Check]]
* [https://www.stmintz.com/ccc/index.php?id=420049 Parsing enormous.pgn] by [[Tor Lattimore|Tor Alexander Lattimore]], [[CCC]], April 08, 2005 » [[Portable Game Notation]]
=External Links=

Navigation menu