Transfer of Temporal Logic Formulas in Reinforcement Learning.

Zhe Xu,Ufuk Topcu

doi:10.24963/ijcai.2019/557

Abstract

Transferring high-level knowledge from a source task to a target task is an effective way to expedite reinforcement learning (RL). For example, propositional logic and first-order logic have been used as representations of such knowledge. We study the transfer of knowledge between tasks in which the timing of the events matters. We call such tasks temporal tasks. We concretize similarity between temporal tasks through a notion of logical transferability, and develop a transfer learning approach between different yet similar temporal tasks. We first propose an inference technique to extract metric interval temporal logic (MITL) formulas in sequential disjunctive normal form from labeled trajectories collected in RL of the two tasks. If logical transferability is identified through this inference, we construct a timed automaton for each sequential conjunctive subformula of the inferred MITL formulas from both tasks. We perform RL on the extended state which includes the locations and clock valuations of the timed automata for the source task. We then establish mappings between the corresponding components (clocks, locations, etc.) of the timed automata from the two tasks, and transfer the extended Q-functions based on the established mappings. Finally, we perform RL on the extended state for the target task, starting with the transferred extended Q-functions. Our implementation results show, depending on how similar the source task and the target task are, that the sampling efficiency for the target task can be improved by up to one order of magnitude by performing RL in the extended state space, and further improved by up to another order of magnitude using the transferred extended Q-functions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transfer of Temporal Logic Formulas in Reinforcement Learning.

Abstract

Talk to us

Similar Papers

More From: IJCAI : proceedings of the conference

Lead the way for us

Journal: IJCAI : proceedings of the conference	Publication Date: Aug 1, 2019
Citations: 50

Similar Papers

Timed automata for metric interval temporal logic formulae in prototype verification system
Qing-Guo Xu ... Huai-Kou Miao
Journal of Shanghai University (English Edition) | VOL. 12
Qing-Guo Xu, et. al.Qing-Guo Xu ... Huai-Kou Miao
01 Aug 2008
Journal of Shanghai University (English Edition) | VOL. 12

Revisiting MITL to Fix Decision Procedures
Nima Roohi ... Mahesh Viswanathan
-
Nima Roohi, et. al.Nima Roohi ... Mahesh Viswanathan
29 Dec 2017
29 Dec 2017

Robustness of temporal logic specifications for continuous-time signals
Georgios E Fainekos ... George J Pappas
Theoretical Computer Science | VOL. 410
Georgios E Fainekos, et. al.Georgios E Fainekos ... George J Pappas
21 Jun 2009
Theoretical Computer Science | VOL. 410

Temporal Logic Verification for Delay Differential Equations
Peter Nazier Mosaad ... Bai Xue
-
Peter Nazier Mosaad, et. al.Peter Nazier Mosaad ... Bai Xue
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transfer of Temporal Logic Formulas in Reinforcement Learning.

Abstract

Talk to us

Similar Papers

More From: IJCAI : proceedings of the conference