Improving cognitive agent decision making: Experience trajectories as plans

Jens Pfau,Samin Karim,Liz Sonenberg,Michael Kirley

doi:10.3233/wia-140296

Abstract

In task environments with large state and action spaces, the use of temporal and state abstraction can potentially improve the decision making performance of agents. However, existing approaches within a reinforcement learning framework typically identify possible subgoal states and instantly learn stochastic subpolicies to reach them from other states. In these circumstances, exploration of the reinforcement learner is unfavorably biased towards local behavior around these subgoals; temporal abstractions are not exploited to reduce required deliberation; and the benefit of employing temporal abstractions is conflated with the benefit of additional learning done to define subpolicies. In this paper, we consider a cognitive agent architecture that allows for the extraction and reuse of temporal abstractions in the form of experience trajectories from a bottom-level reinforcement learning module and a top-level module based on the BDI (Belief-Desire-Intention) model. Here, the reuse of trajectories depends on the situation in which their recording was started. We investigate the efficacy of our approach using two well-known domains – the pursuit and the taxi domains. Detailed simulation experiments demonstrate that the use of experience trajectories as plans acquired at runtime can reduce the amount of decision making without significantly affecting asymptotic performance. The combination of temporal and state abstraction leads to improved performance during the initial learning of the reinforcement learner. Our approach can significantly reduce the number of deliberations required.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving cognitive agent decision making: Experience trajectories as plans

Abstract

Talk to us

Similar Papers

More From: Web Intelligence and Agent Systems: An International Journal

Lead the way for us

Similar Papers

Hierarchical reinforcement learning for transportation infrastructure maintenance planning
Zachary Hamida ... James-A Goulet
Reliability Engineering & System Safety | VOL. 235
Zachary Hamida, et. al.Zachary Hamida ... James-A Goulet
08 Mar 2023
Reliability Engineering & System Safety | VOL. 235

Improving the Performance of Batch-Constrained Reinforcement Learning in Continuous Action Domains via Generative Adversarial Networks
Baturay Saglam ... Suleyman S Kozat
-
Baturay Saglam, et. al.Baturay Saglam ... Suleyman S Kozat
15 May 2022
15 May 2022

Cooperative modular reinforcement learning for large discrete action space problem
Fangzhu Ming ... Chengmei Zhao
Neural Networks | VOL. 161
Fangzhu Ming, et. al.Fangzhu Ming ... Chengmei Zhao
02 Feb 2023
Neural Networks | VOL. 161

Generalising Discrete Action Spaces with Conditional Action Trees
Christopher Bamford ... Alvaro Ovalle
-
Christopher Bamford, et. al.Christopher Bamford ... Alvaro Ovalle
17 Aug 2021
17 Aug 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving cognitive agent decision making: Experience trajectories as plans

Abstract

Talk to us

Similar Papers

More From: Web Intelligence and Agent Systems: An International Journal