Few-Shot Bayesian Imitation Learning with Logical Program Policies

Tom Silver,Alex K Lew,Kelsey R Allen,Leslie Pack Kaelbling,Josh Tenenbaum

doi:10.1609/aaai.v34i06.6587

Abstract

Humans can learn many novel tasks from a very small number (1–5) of demonstrations, in stark contrast to the data requirements of nearly tabula rasa deep learning methods. We propose an expressive class of policies, a strong but general prior, and a learning algorithm that, together, can learn interesting policies from very few examples. We represent policies as logical combinations of programs drawn from a domain-specific language (DSL), define a prior over policies with a probabilistic grammar, and derive an approximate Bayesian inference algorithm to learn policies from demonstrations. In experiments, we study six strategy games played on a 2D grid with one shared DSL. After a few demonstrations of each game, the inferred policies generalize to new game instances that differ substantially from the demonstrations. Our policy learning is 20–1,000x more data efficient than convolutional and fully convolutional policy learning and many orders of magnitude more computationally efficient than vanilla program induction. We argue that the proposed method is an apt choice for tasks that have scarce training data and feature significant, structured variation between task instances.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 13	License type: cc-by-nc

R Discovery Prime

R Discovery Prime

Few-Shot Bayesian Imitation Learning with Logical Program Policies

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Unsupervised learning of probabilistic grammars
Kewei Tu
-
Kewei TuKewei Tu
31 Oct 2012
31 Oct 2012

Learning of Structurally Unambiguous Probabilistic Grammars
Dolav Nitay ... Michal Ziv-Ukelson
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Dolav Nitay, et. al.Dolav Nitay ... Michal Ziv-Ukelson
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

A study on foraging behavior of swarm robots using reinforcement learning techniques

-

03 Feb 2017
03 Feb 2017

Query-specific learning and inference for probabilistic graphical models
...
-
, et. al. ...
01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Few-Shot Bayesian Imitation Learning with Logical Program Policies

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence