Active Grammatical Inference for Non-Markovian Planning

Noah Topper,George Atia,Alvaro Velasquez,Ashutosh Trivedi

doi:10.1609/icaps.v32i1.19853

Abstract

Planning in finite stochastic environments is canonically posed as a Markov decision process where the transition and reward structures are explicitly known. Reinforcement learning (RL) lifts the explicitness assumption by working with sampling models instead. Further, with the advent of reward machines, we can relax the Markovian assumption on the reward. Angluin's active grammatical inference algorithm L* has found novel application in explicating reward machines for non-Markovian RL. We propose maintaining the assumption of explicit transition dynamics, but with an implicit non-Markovian reward signal, which must be inferred from experiments. We call this setting non-Markovian planning, as opposed to non-Markovian RL. The proposed approach leverages L* to explicate an automaton structure for the underlying planning objective. We exploit the environment model to learn an automaton faster and integrate it with value iteration to accelerate the planning. We compare against recent non-Markovian RL solutions which leverage grammatical inference, and establish complexity results that illustrate the difference in runtime between grammatical inference in planning and RL settings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Active Grammatical Inference for Non-Markovian Planning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling

Lead the way for us

Similar Papers

An Extension of Finite-state Markov Decision Process and an Application of Grammatical Inference
Takeshi Shibata ... Ryo Yoshinak
-
Takeshi Shibata, et. al.Takeshi Shibata ... Ryo Yoshinak
01 Jan 2008
01 Jan 2008

ML-based Reinforcement Learning Approach for Power Management in SoCs
David Akselrod
-
David AkselrodDavid Akselrod
01 Sep 2019
01 Sep 2019

Active Inference: Demystified and Compared.
Noor Sajid ... Karl J Friston
Neural Computation | VOL. 33
Noor Sajid, et. al.Noor Sajid ... Karl J Friston
05 Jan 2021
Neural Computation | VOL. 33

Reinforcement Learning for Clinical Applications.
Kia Khezeli ... Benjamin Shickel
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18
Kia Khezeli, et. al.Kia Khezeli ... Benjamin Shickel
08 Feb 2023
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Active Grammatical Inference for Non-Markovian Planning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling