RL Generalization in a Theory of Mind Game Through a Sleep Metaphor (Student Abstract)

Tyler Malloy,Tim Klinger,Miao Liu,Matthew Riemer,Gerald Tesauro,Chris R Sims

doi:10.1609/aaai.v35i18.17917

Abstract

Training agents to learn efficiently in multi-agent environments can benefit from the explicit modelling of other agent's beliefs, especially in complex limited-information games such as the Hanabi card game. However, generalization is also highly relevant to performance in these games, though model comparisons at large training timescales can be difficult. In this work, we address this by introducing a novel model trained using a sleep metaphor on a reduced complexity version of the Hanabi game. This sleep metaphor consists an altered training regiment, as well as an information-theoretic constraint on the agent's policy. Results from experimentation demonstrate improved performance through this sleep-metaphor method, and provide a promising motivation for using similar techniques in more complex methods that incorporate explicit models of other agent's beliefs.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RL Generalization in a Theory of Mind Game Through a Sleep Metaphor (Student Abstract)

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 1

Similar Papers

An agent belief system for multi-agent systems
Wenjie Wang ... Zhongzhi Shi
-
Wenjie Wang, et. al. Wenjie Wang ... Zhongzhi Shi
14 Oct 1996
14 Oct 1996

On the reciprocal interaction between believing and feeling: an adaptive agent modelling perspective
Zulfiqar A Memon ... Jan Treur
Cognitive Neurodynamics | VOL. 4
Zulfiqar A Memon, et. al.Zulfiqar A Memon ... Jan Treur
06 Oct 2010
Cognitive Neurodynamics | VOL. 4

A framework for cooperative work: An approach based on the intentionality
B Chaibdraa ... P Millot
Artificial Intelligence in Engineering | VOL. 5
B Chaibdraa, et. al.B Chaibdraa ... P Millot
01 Oct 1990
Artificial Intelligence in Engineering | VOL. 5

Approximate state estimation in multiagent settings with continuous or large discrete state spaces
Prashant Doshi
-
Prashant DoshiPrashant Doshi
14 May 2007
14 May 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RL Generalization in a Theory of Mind Game Through a Sleep Metaphor (Student Abstract)

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence