Theory of mind as inverse reinforcement learning

Julian Jara-Ettinger

doi:10.1016/j.cobeha.2019.04.010

Theory of mind as inverse reinforcement learning

Julian Jara-Ettinger

https://doi.org/10.1016/j.cobeha.2019.04.010

Copy DOI

Journal: Current Opinion in Behavioral Sciences	Publication Date: Jun 13, 2019
Citations: 88

Affiliation: Yale University

#Inverse Reinforcement Learning #Theory Of Mind + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We review the idea that Theory of Mind—our ability to reason about other people's mental states—can be formalized as inverse reinforcement learning. Under this framework, expectations about how mental states produce behavior are captured in a reinforcement learning (RL) model. Predicting other people’s actions is achieved by simulating a RL model with the hypothesized beliefs and desires, while mental-state inference is achieved by inverting this model. Although many advances in inverse reinforcement learning (IRL) did not have human Theory of Mind in mind, here we focus on what they reveal when conceptualized as cognitive theories. We discuss landmark successes of IRL, and key challenges in building human-like Theory of Mind.

Full Text