**Stuart J. Russell**,

a British computer scientist, professor and chair of Computer Science, Lotfi A. Zadeh chair in Engineering, and Director, Center for Intelligent Systems, Computer Science Division, University of California, Berkeley, California.
He received his B.A. with first-class honors in physics from Oxford University in 1982, and his Ph.D. in Computer Science from Stanford in 1986. He is also an adjunct professor of Neurological Surgery at UC San Francisco.
He is a Fellow and former Executive Council member of the AAAI and a fellow of the ACM. His research covers a wide range of topics in artificial intelligence including machine learning, probabilistic reasoning, knowledge representation, planning, real-time decision making, multitarget tracking, and computer vision ^{[2]} .

# Optimal Game-Tree Search

Abstract from Stuart Russell, Eric Wefald (**1989**). *On optimal game-tree search using rational metareasoning* ^{[3]}:

In this paper we outline a general approach to the study of problem-solving, in which search steps are considered decisions in the same sense as actions in the world. Unlike other metrics in the literature, the value of a search step is defined as a real utility rather than as a quasi-utility, and can therefore be computed directly from a model of the base-level problem-solver. We develop a formula for the expected value of a search step in a game-playing context using the single-step assumption, namely that a computation step can be evaluated as it was the last to be taken. We prove some meta-level theorems that enable the development of a low-overhead algorithm, MGSS*, that chooses search steps in order of highest estimated utility. Although we show that the single-step assumption is untenable in general, a program implemented for the game of Othello soundly beats an alpha-beta search while expanding significantly fewer nodes, even though both programs use the same evaluation function.

# Selected Publications

## 1986 ...

- Stuart Russell (
**1986**).*Preliminary Steps Toward the Automation of Induction*. AAAI-86 - Stuart Russell (
**1986**).*Quantitative Analysis of Analogy.*. AAAI-86 - Stuart Russell, Eric Wefald (
**1988**).*Decision-Theoretic Search Control: General Theory and an Application to Game-Playing.*CS Technical Report 88/435, University of California, Berkeley - Stuart Russell, Eric Wefald (
**1988**).*Multi-Level Decision-Theoretic Search.*AAAI Symposium on Computer Game-Playing, Stanford. - Stuart Russell, Eric Wefald (
**1989**).*On optimal game-tree search using rational metareasoning.*IJCAI 1989 - Eric Wefald, Stuart Russell (
**1989**).*Adaptive Learning of Decision-Theoretic Search Control Knowledge*. 6th International Workshop on Machine Learning

## 1990 ...

- Stuart Russell (
**1990, 2013**).*Fine-Grained Decision-Theoretic Search Control*. arXiv:1304.1133 - Stuart Russell, Eric Wefald (
**1991**).*Principles of Metareasoning.*Artificial Intelligence, Vol. 49, Nos. 1-3 - Stuart Russell, Eric Wefald (
**1991**).*Do the right thing: studies in limited rationality*. MIT Press - Stuart Russell (
**1992**).*Efficient Memory-Bounded Search Methods.*10th European Conference on Artificial Intelligence, Vienna - Stuart Russell, Peter Norvig (
**1994**).*A Modern, Agent-Oriented Approach to AI Instruction.*AAAI Fall Symposium on Innovative Instruction for Introductory AI, New Orleans - Stuart Russell (
**1996**).*Machine Learning.*Chapter 4 of M. A. Boden (Ed.), Artificial Intelligence, Academic Press. Part of the Handbook of Perception and Cognition - Nir Friedman, Moises Goldszmidt, David Heckerman, Stuart Russell (
**1997**).*Where is the Impact of Bayesian Networks in Learning?*IJCAI 1997 - Ronald Parr, Stuart Russell (
**1997**).*Reinforcement Learning with Hierarchies of Machines.*in Advances in Neural Information Processing Systems 10, MIT Press - Vassilis Papavassiliou, Stuart Russell (
**1999**).*Convergence of reinforcement learning with general function approximators.*IJCAI-99

## 2000 ...

- Andrew Y. Ng, Stuart Russell (
**2000**).*Algorithms for inverse reinforcement learning.*17th International Conference on Machine Learning, Stanford - Stuart Russell (
**2003**).*Rationality and Intelligence.*in Renée Elio (ed.)*Common sense, reasoning, and rationality*. Oxford University Press, pdf - Judea Pearl, Stuart Russell (
**2003**).*Bayesian Networks.*pdf - Stuart Russell, Peter Norvig (
**2003**).*Artificial Intelligence: A Modern Approach*. 2nd edition - Bhaskara Marthi, David Latham, Carlos Guestrin, Stuart Russell (
**2005**).*Concurrent Hierarchical Reinforcement Learning.*IJCAI 2005, pdf - Bhaskara Marthi, Stuart Russell, David Latham (
**2005**).*Writing Stratagus-Playing Agents in Concurrent ALisp.*IJCAI-05, pdf - Stuart Russell, Jason Wolfe (
**2005**).*Efficient belief-state AND–OR search, with application to Kriegspiel*. IJCAI 2005, pdf » KriegSpiel - Jason Wolfe, Stuart Russell (
**2007**).*Exploiting Belief State Structure in Graph Search.*ICAPS 2007, pdf

## 2010 ...

- Stuart Russell, Peter Norvig (
**2010**).*Artificial Intelligence: A Modern Approach*. 3rd edition - Jason Wolfe, Stuart Russell (
**2011**).*Bounded Intention Planning*. IJCAI 2011 - Stuart Russell (
**2014**).*Unifying Logic and Probability: A New Dawn for AI*? IPMU 2014, pdf - Stuart Russell (
**2015**).*Unifying logic and probability*. Communications of the ACM, Vol. 58, No. 7 - David A. Moore, Stuart Russell (
**2015**).*Gaussian Process Random Fields*. arXiv:1511.00054 - Dylan Hadfield-Menell, Anca Dragan, Pieter Abbeel, Stuart Russell (
**2016**).*The Off-Switch Game*. arXiv:1611.08219 - Tongzhou Wan, Yi Wu, David A. Moore, Stuart Russell (
**2017**).*Neural Block Sampling*. arXiv:1708.06040 - Dylan Hadfield-Menell, Smitha Milli, Pieter Abbeel, Stuart Russell, Anca Dragan (
**2017**).*Inverse Reward Design**. arXiv:1711.02827* - Dhruv Malik, Malayandi Palaniappan, Jaime F. Fisac, Dylan Hadfield-Menell, Stuart Russell, Anca Dragan ('
**2018**).*An Efficient, Generalized Bellman Update For Cooperative Inverse Reinforcement Learning*. arXiv:1806.03820

