Changes

Newer edit →

Rémi Munos

6,225 bytes added, 22:26, 3 June 2018

'''[[Main Page|Home]] * [[People]] * Rémi Munos'''

[[FILE:remimunus.jpg|border|right|thumb|link=http://www.cmap.polytechnique.fr/~munos/| Rémi Munos <ref>[http://www.cmap.polytechnique.fr/~munos/ Remi Munos Home page]</ref> ]]

'''Rémi Munos''',<br/>
a French mathematician and computer scientist at [[Google]] [[DeepMind]], from 2000 to 2006 Associate Professor at the Centre de Mathématiques Appliquées, [https://en.wikipedia.org/wiki/%C3%89cole_Polytechnique Ecole Polytechnique] and later affiliated with [https://en.wikipedia.org/wiki/National_Institute_for_Research_in_Computer_Science_and_Control INRIA] [https://en.wikipedia.org/wiki/Lille Lille] <ref>[http://www.inria.fr/lille/home/view?set_language=en home — INRIA Lille - Nord Europe]</ref>. His research interests covers [[Reinforcement Learning|reinforcement learning]], [https://en.wikipedia.org/wiki/Multi-armed_bandit multi-armed bandits], and [[Dynamic Programming|dynamic programming]]. Rémi Muno was contributor of the [[Go]] playing program [https://www.game-ai-forum.org/icga-tournaments/program.php?id=515 Mogo], using [[Monte-Carlo Tree Search]] which uses patterns in the simulations and improvements in [[UCT]].

=Selected Publications=
<ref>[http://researchers.lille.inria.fr/~munos/papers/bib_remi.html Publications Remi Munos]</ref> <ref>[http://dblp.uni-trier.de/pers/hd/m/Munos:R=eacute=mi dblp: Rémi Munos]</ref>
==1996==
* [[Rémi Munos]] ('''1996'''). ''A convergent reinforcement learning algorithm in the continuous case : the finite-element reinforcement learning''. In International Conference on Machine Learning. Morgan Kaufmann
==2005 ...==
* [[Sylvain Gelly]], [[Yizao Wang]], [[Rémi Munos]], [[Olivier Teytaud]] ('''2006'''). ''Modiﬁcation of UCT with Patterns in Monte-Carlo Go''. [http://hal.inria.fr/inria-00117266 INRIA]
* [[Jean-Yves Audibert]], [[Rémi Munos]], [[Csaba Szepesvári]] ('''2007'''). ''Tuning Bandit Algorithms in Stochastic Environments''. [http://certis.enpc.fr/~audibert/ucb_alt.pdf pdf]
* [[Yizao Wang]], [[Jean-Yves Audibert]], [[Rémi Munos]] ('''2008'''). ''Algorithms for Infinitely Many-Armed Bandits'', , Advances in Neural Information Processing Systems, [http://www.stat.lsa.umich.edu/%7Eyizwang/publications/wang08algorithms.pdf pdf], Supplemental material - [http://www.stat.lsa.umich.edu/%7Eyizwang/publications/wang08algorithmsSupp.pdf pdf]
* [[Rémi Munos]], [[Csaba Szepesvári]] ('''2008'''). ''Finite time bounds for sampling based fitted value iteration''. Journal of Machine Learning Research, 9:815-857, 2008. [http://hal.inria.fr/docs/00/26/09/34/PDF/savi_1.5.pdf pdf], [http://www.ualberta.ca/~szepesva/papers/munos08a.pdf pdf]
* [http://www.informatik.uni-trier.de/~ley/db/indices/a-tree/m/Ma=icirc=trepierre:Rapha=euml=l.html Raphaël Maîtrepierre], [[Jérémie Mary]], [[Rémi Munos]] ('''2008'''). ''[http://books.google.com/books?id=3Wsn3SwEBScC&pg=PA458&lpg=PA458&dq=Rapha%EBl+Ma%EEtrepierre&source=bl&ots=a_vvlH9zqQ&sig=QQqkAN4diLmIK9DOxuigUF7cQrY&hl=en&ei=WVG7TP2lHcGQswaYptmsDQ&sa=X&oi=book_result&ct=result&resnum=2&ved=0CCAQ6AEwAQ#v=onepage&q&f=false Adaptive play in Texas Hold'em Poker]''. [http://www.informatik.uni-trier.de/~ley/db/conf/ecai/ecai2008.html ECAI 2008]
* [[Jean-Yves Audibert]], [[Rémi Munos]], [[Csaba Szepesvári]] ('''2009'''). ''Exploration-exploitation trade-off using variance estimates in multi-armed bandits''. Theoretical Computer Science, 410:1876-1902, 2009, [http://www.ualberta.ca/~szepesva/papers/ucbtuned-journal.pdf pdf]
* [[Vincent Berthier]], [[Amine Bourki]], [[Matthieu Coulm]], [[Guillaume Chaslot]], [[Christophe Fiter]], [[Sylvain Gelly]], [[Jean-Baptiste Hoock]], [[Rémi Munos]], [[Julien Pérez]], [[Arpad Rimmel]], [[Philippe Rolet]], [[Olivier Teytaud]], [[Paul Vayssière]], [[Yizao Wang]], [[Ziqin Yu]] (et al.) ('''2009'''). ''Computer-Go is not only for Go''. Korea, August 2009 [http://www.lri.fr/~teytaud/korea.pdf slides as pdf]
==2010 ...==
* [[Rémi Munos]] ('''2010'''). ''Approximate dynamic programming''. In [http://www.isir.upmc.fr/?op=view_profil&id=28&old=N&lang=en Olivier Sigaud] and [http://www.loria.fr/~buffet/ Olivier Buffet], editors, [http://eu.wiley.com/WileyCDA/WileyTitle/productCd-1848211678.html Markov Decision Processes in Artificial Intelligence], chapter 3, pages 67-98. ISTE Ltd and [http://eu.wiley.com/WileyCDA/ John Wiley & Sons Inc.], [http://researchers.lille.inria.fr/~munos/papers/files/MDPIA_chap3.pdf pdf]
* [[Mathematician#ROrtner|Ronald Ortner]], [[Mathematician#DRyabko|Daniil Ryabko]], [[Peter Auer]], [[Rémi Munos]] ('''2014'''). ''Regret bounds for restless Markov bandits''. [https://en.wikipedia.org/wiki/Theoretical_Computer_Science_%28journal%29 Theoretical Computer Science] 558, [http://daniil.ryabko.net/mabajr.pdf pdf]
* [[Rémi Munos]] ('''2014'''). ''From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning''. [http://dblp.uni-trier.de/db/journals/ftml/ftml7.html#Munos14 Foundations and Trends in Machine Learning, Vol. 7, No 1], [https://hal.archives-ouvertes.fr/hal-00747575 hal-00747575v5], [http://chercheurs.lille.inria.fr/~munos/papers/files/AAAI2013_slides.pdf slides as pdf]
==2015 ...==
* [[Audrūnas Gruslys]], [[Rémi Munos]], [[Ivo Danihelka]], [[Marc Lanctot]], [[Alex Graves]] ('''2016'''). ''Memory-Efficient Backpropagation Through Time''. [https://arxiv.org/abs/1606.03401v1 arXiv:1606.03401]
* [[Jane X Wang]], [[Zeb Kurth-Nelson]], [[Dhruva Tirumala]], [[Hubert Soyer]], [[Joel Z Leibo]], [[Rémi Munos]], [[Charles Blundell]], [[Dharshan Kumaran]], [[Matt Botvinick]] ('''2016'''). ''Learning to reinforcement learn''. [https://arxiv.org/abs/1611.05763 arXiv:1611.05763]
* [[Arthur Guez]], [[Théophane Weber]], [[Ioannis Antonoglou]], [[Karen Simonyan]], [[Oriol Vinyals]], [[Daan Wierstra]], [[Rémi Munos]], [[David Silver]] ('''2016'''). ''Learning to Search with MCTSnets''. [https://arxiv.org/abs/1802.04697 arXiv:1802.04697]

=External Links=
* [http://researchers.lille.inria.fr/%7Emunos/ Remi Munos Homepage]
* [https://scholar.google.com/citations?user=OvKEnVwAAAAJ&hl=en Rémi Munos - Google Scholar Citations]

=References=
<references />

'''[[People|Up one level]]'''

GerdIsenberg

Bureaucrats, Administrators

25,161

edits

Changes

Rémi Munos

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools