Changes

Newer edit →

Marc Lanctot

5,848 bytes added, 23:02, 3 June 2018

'''[[Main Page|Home]] * [[People]] * Marc Lanctot'''

[[FILE:marc-oct-2012.jpg|border|right|thumb|link=http://mlanctot.info/| Marc Lanctot <ref>[http://mlanctot.info/ Marc Lanctot's Web Page]</ref> ]]

'''Marc Lanctot''',<br/>
a Canadian computer scientist at [[Google]] [[DeepMind]] involved in the [[AlphaZero]] project, and before post-doctoral researcher for the [[Maastricht University]] Games and AI Group <ref>[https://project.dke.maastrichtuniversity.nl/games/ Games and AI Group]</ref> of [[Mark Winands]]. He holds a M.Sc. from [[McGill University]] in 2005 <ref>[[Marc Lanctot]] ('''2005'''). ''Adaptive Virtual Environments in Multi-player Computer Games''. MSc. Thesis, [[McGill University]]</ref> , and a Ph.D. from [[University of Alberta]] in 2013 <ref>[[Marc Lanctot]] ('''2013'''). ''Monte Carlo Sampling and Regret Minimization for Equilibrium Computation and Decision-Making in Large Extensive Form Games''. Ph.D. thesis, [[University of Alberta]], advisor [[Michael Bowling]]</ref> . Marc is generally interested in [[Artificial Intelligence|AI]], [[Learning|machine learning]], and [[Games|games]]. His current research focus is on [https://en.wikipedia.org/wiki/Sampling_%28statistics%29 sampling] algorithms for [https://en.wikipedia.org/wiki/Computable_general_equilibrium equilibrium computation] and [https://en.wikipedia.org/wiki/Decision-making decision-making], as well as variants of [[Monte-Carlo Tree Search]].

=Selected Publications=
<ref>[http://www.informatik.uni-trier.de/~ley/pers/hd/l/Lanctot:Marc.html dblp: Marc Lanctot]</ref>
==2005 ...==
* [[Marc Lanctot]] ('''2005'''). ''Adaptive Virtual Environments in Multi-player Computer Games''. MSc. Thesis, [[McGill University]]
* [[Frantisek Sailer]], [[Michael Buro]], [[Marc Lanctot]] ('''2007'''). ''Adversarial planning through strategy simulation''. [[IEEE#CIG|Computational Intelligence and Games]], [https://skatgame.net/mburo/ps/rtsmc.pdf pdf]
==2010 ...==
* [[Joel Veness]], [[Marc Lanctot]], [[Michael Bowling]] ('''2011'''). ''Variance Reduction in Monte-Carlo Tree Search''. [http://papers.nips.cc/book/advances-in-neural-information-processing-systems-24-2011 NIPS], [http://papers.nips.cc/paper/4288-variance-reduction-in-monte-carlo-tree-search.pdf pdf]
* [[Marc Lanctot]], [[Abdallah Saffidine]], [[Joel Veness]], [[Christopher Archibald]] ('''2012'''). ''Sparse Sampling for Adversarial Games''. [[ECAI CGW 2012]]
* [[Marc Lanctot]] ('''2013'''). ''Monte Carlo Sampling and Regret Minimization for Equilibrium Computation and Decision-Making in Large Extensive Form Games''. Ph.D. thesis, [[University of Alberta]], advisor [[Michael Bowling]]
* [[Marc Lanctot]], [[Abdallah Saffidine]], [[Joel Veness]], [[Christopher Archibald]], [[Mark Winands]] ('''2013'''). ''Monte Carlo *-Minimax Search''. [[Conferences#IJCAI|IJCAI 2013]]
* [[Markus Esser]], [[Michael Gras]], [[Mark Winands]], [[Maarten Schadd]], [[Marc Lanctot]] ('''2013'''). ''Improving Best-Reply Search''. [[CG 2013]]
* [[Marc Lanctot]] ('''2013'''). ''LOA Wins Lines of Action Tournament''. [[ICGA Journal#36_4|ICGA Journal, Vol. 36, No. 4]] » [[17th Computer Olympiad#LOA|17th Computer Olympiad]]
* [[Marc Lanctot]] ('''2013'''). ''SIA Wins Surakarta Tournament''. [[ICGA Journal#36_4|ICGA Journal, Vol. 36, No. 4]] » [[17th Computer Olympiad#Surakarta|17th Computer Olympiad]]
* [[Tom Pepels]], [[Tristan Cazenave]], [[Mark Winands]], [[Marc Lanctot]] ('''2014'''). ''Minimizing Simple and Cumulative Regret in Monte-Carlo Tree Search''. [[ECAI CGW 2014]]
==2015 ...==
* [[Johannes Heinrich]], [[Marc Lanctot]], [[David Silver]] ('''2015'''). ''Fictitious Self-Play in Extensive-Form Games''. [http://proceedings.mlr.press/v37/ JMLR: W&CP, Vol. 37], [http://proceedings.mlr.press/v37/heinrich15.pdf pdf]
* [[Ziyu Wang]], [[Nando de Freitas]], [[Marc Lanctot]] ('''2016'''). ''Dueling Network Architectures for Deep Reinforcement Learning''. [http://arxiv.org/abs/1511.06581 arXiv:1511.06581]
* [[David Silver]], [[Shih-Chieh Huang|Aja Huang]], [[Chris J. Maddison]], [[Arthur Guez]], [[Laurent Sifre]], [[George van den Driessche]], [[Julian Schrittwieser]], [[Ioannis Antonoglou]], [[Veda Panneershelvam]], [[Marc Lanctot]], [[Sander Dieleman]], [[Dominik Grewe]], [[John Nham]], [[Nal Kalchbrenner]], [[Ilya Sutskever]], [[Timothy Lillicrap]], [[Madeleine Leach]], [[Koray Kavukcuoglu]], [[Thore Graepel]], [[Demis Hassabis]] ('''2016'''). ''[http://www.nature.com/nature/journal/v529/n7587/full/nature16961.html Mastering the game of Go with deep neural networks and tree search]''. [https://en.wikipedia.org/wiki/Nature_%28journal%29 Nature], Vol. 529 » [[AlphaGo]]
* [[Audrūnas Gruslys]], [[Rémi Munos]], [[Ivo Danihelka]], [[Marc Lanctot]], [[Alex Graves]] ('''2016'''). ''Memory-Efficient Backpropagation Through Time''. [https://arxiv.org/abs/1606.03401v1 arXiv:1606.03401]
* [[Marc Lanctot]], [[Vinícius Flores Zambaldi]], [[Audrunas Gruslys]], [[Angeliki Lazaridou]], [[Karl Tuyls]], [[Julien Pérolat]], [[David Silver]], [[Thore Graepel]] ('''2017'''). ''A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning''. [https://arxiv.org/abs/1711.00832 arXiv:1711.00832]
* [[David Silver]], [[Thomas Hubert]], [[Julian Schrittwieser]], [[Ioannis Antonoglou]], [[Matthew Lai]], [[Arthur Guez]], [[Marc Lanctot]], [[Laurent Sifre]], [[Dharshan Kumaran]], [[Thore Graepel]], [[Timothy Lillicrap]], [[Karen Simonyan]], [[Demis Hassabis]] ('''2017'''). ''Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm''. [https://arxiv.org/abs/1712.01815 arXiv:1712.01815] » [[AlphaZero]]

=External Links=
* [http://mlanctot.info/ Marc Lanctot's Web Page]
* [http://scholar.google.com/citations?user=E_oZZj8AAAAJ&hl=en Marc Lanctot - Google Scholar Citations]

=References=
<references />

'''[[People|Up one level]]'''

GerdIsenberg

Bureaucrats, Administrators

25,161

edits

Changes

Marc Lanctot

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools