Analysis of Agent Expertise in Ms. Pac-Man Using Value-of-Information-Based Policies

Isaac John Sledge,Jose C Principe

doi:10.1109/tg.2018.2808201

Abstract

Conventional reinforcement learning methods for Markov decision processes rely on weakly-guided, stochastic searches to drive the learning process. It can therefore be difficult to predict what agent behaviors might emerge. In this paper, we consider an information-theoretic cost function for performing constrained stochastic searches that promote the formation of risk-averse to risk-favoring behaviors. This cost function is the value of information, which provides the optimal trade-off between the expected return of a policy and the policy's complexity; policy complexity is measured by number of bits and controlled by a single hyperparameter on the cost function. As the policy complexity is reduced, the agents will increasingly eschew risky actions. This reduces the potential for high accrued rewards. As the policy complexity increases, the agents will take actions, regardless of the risk, that can raise the long-term rewards. The obtainable reward depends on a single, tunable hyperparameter that regulates the degree of policy complexity. We evaluate the performance of value-of-information-based policies on a stochastic version of Ms. Pac-Man. A major component of this paper is the demonstration that ranges of policy complexity values yield different game-play styles and explaining why this occurs. We also show that our reinforcement-learning search mechanism is more efficient than the others we utilize. This result implies that the value of information theory is appropriate for framing the exploitation-exploration trade-off in reinforcement learning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Analysis of Agent Expertise in Ms. Pac-Man Using Value-of-Information-Based Policies

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Games

Lead the way for us

Journal: IEEE Transactions on Games	Publication Date: Jun 1, 2019
Citations: 12

Similar Papers

A new approach for sample size calculation in cost-effectiveness studies based on value of information
Clément Bader ... Antoine Bénard
BMC Medical Research Methodology | VOL. 18
Clément Bader, et. al.Clément Bader ... Antoine Bénard
22 Oct 2018
BMC Medical Research Methodology | VOL. 18

Value of new performance information in healthcare: evidence from Japan.
Susanna Gallani ... Ranjani Krishnan
International Journal of Health Economics and Management | VOL. 20
Susanna Gallani, et. al.Susanna Gallani ... Ranjani Krishnan
18 Aug 2020
International Journal of Health Economics and Management | VOL. 20

Task-oriented information value measurement based on space-time prisms
Yingjie Hu ... Yuqi Chen
International Journal of Geographical Information Science | VOL. 30
Yingjie Hu, et. al.Yingjie Hu ... Yuqi Chen
17 Dec 2015
International Journal of Geographical Information Science | VOL. 30

Prioritizing Disaster Mapping Tasks for Online Volunteers Based on Information Value Theory
Yingjie Hu ... Helen Couclelis
Geographical Analysis | VOL. 49
Yingjie Hu, et. al.Yingjie Hu ... Helen Couclelis
13 Oct 2016
Geographical Analysis | VOL. 49

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis of Agent Expertise in Ms. Pac-Man Using Value-of-Information-Based Policies

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Games