Induction of Subgoal Automata for Reinforcement Learning

Daniel Furelos-Blanco,Mark Law,Krysia Broda,Anders Jonsson,Alessandra Russo

doi:10.1609/aaai.v34i04.5802

Abstract

In this work we present ISA, a novel approach for learning and exploiting subgoals in reinforcement learning (RL). Our method relies on inducing an automaton whose transitions are subgoals expressed as propositional formulas over a set of observable events. A state-of-the-art inductive logic programming system is used to learn the automaton from observation traces perceived by the RL agent. The reinforcement learning and automaton learning processes are interleaved: a new refined automaton is learned whenever the RL agent generates a trace not recognized by the current automaton. We evaluate ISA in several gridworld problems and show that it performs similarly to a method for which automata are given in advance. We also show that the learned automata can be exploited to speed up convergence through reward shaping and transfer learning across multiple tasks. Finally, we analyze the running time and the number of traces that ISA needs to learn an automata, and the impact that the number of observable events have on the learner's performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Induction of Subgoal Automata for Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 24

Similar Papers

Induction and Exploitation of Subgoal Automata for Reinforcement Learning
Daniel Furelos-Blanco ... Mark Law
Journal of Artificial Intelligence Research | VOL. 70
Daniel Furelos-Blanco, et. al.Daniel Furelos-Blanco ... Mark Law
10 Mar 2021
Journal of Artificial Intelligence Research | VOL. 70

Shaping Reward Learning Approach from Passive Samples
Yu Qian ... Zhi-Hua Zhou
Journal of Software | VOL. 24
Yu Qian, et. al.Yu Qian ... Zhi-Hua Zhou
06 Jan 2014
Journal of Software | VOL. 24

FastLAS: Scalable Inductive Logic Programming Incorporating Domain-Specific Optimisation Criteria
Mark Law ... Alessandra Russo
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34
Mark Law, et. al.Mark Law ... Alessandra Russo
03 Apr 2020
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34

Learning relational rule from examples that are neither positive nor negative
Ryutaro Ichise ... Masayuki Numao
Systems and Computers in Japan | VOL. 32
Ryutaro Ichise, et. al.Ryutaro Ichise ... Masayuki Numao
26 Nov 2001
Systems and Computers in Japan | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Induction of Subgoal Automata for Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence