Domain Adaptation for Imitation Learning Using Generative Adversarial Network.

Tho Nguyen Duc,Eiji Kamioka,Phan Xuan Tan,Chanh Minh Tran

doi:10.3390/s21144718

Tho Nguyen Duc, Eiji Kamioka + Show 2 more

Open Access

https://doi.org/10.3390/s21144718

Copy DOI

Abstract

Imitation learning is an effective approach for an autonomous agent to learn control policies when an explicit reward function is unavailable, using demonstrations provided from an expert. However, standard imitation learning methods assume that the agents and the demonstrations provided by the expert are in the same domain configuration. Such an assumption has made the learned policies difficult to apply in another distinct domain. The problem is formalized as domain adaptive imitation learning, which is the process of learning how to perform a task optimally in a learner domain, given demonstrations of the task in a distinct expert domain. We address the problem by proposing a model based on Generative Adversarial Network. The model aims to learn both domain-shared and domain-specific features and utilizes it to find an optimal policy across domains. The experimental results show the effectiveness of our model in a number of tasks ranging from low to complex high-dimensional.

Highlights

The demand for autonomous agents capable of mimicking human behaviors has grown significantly in recent years
The problem is formalized as domain adaptive imitation learning, which is a process of learning how to perform a task optimally in a learner domain, given demonstrations of the task in a distinct expert domain [14]
The evaluation results of the proposed DAIL-Generative Adversarial Network (GAN) model on lowand high-dimensional tasks are presented to highlight its superior capability in domain adaptive imitation learning

Summary

Introduction

The demand for autonomous agents capable of mimicking human behaviors has grown significantly in recent years. In order for autonomous agents to acquire such human complex behaviors, they are supplied with reward functions indicating the goals of the desired behaviors. Humans can learn complex behaviors from imitation: we observe other experts performing the tasks, infer the tasks, attempt to accomplish the same tasks ourselves. Inspired by this learning procedure, imitation learning has been widely used for training autonomous agents using expert-provided demonstrations [1,2,3,4]

Objectives

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Jul 9, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Domain Adaptation for Imitation Learning Using Generative Adversarial Network.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Multiclass Land Cover Mapping from Historical Orthophotos Using Domain Adaptation and Spatio-Temporal Transfer Learning
Wouter A J Van Den Broeck ... Maarten Loopmans
Remote sensing | VOL. 14
Wouter A J Van Den Broeck, et. al.Wouter A J Van Den Broeck ... Maarten Loopmans
22 Nov 2022
Remote sensing | VOL. 14

Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation
Yuxuan Liu ... Pieter Abbeel
-
Yuxuan Liu, et. al.Yuxuan Liu ... Pieter Abbeel
01 May 2018
01 May 2018

ACGAIL: Imitation Learning About Multiple Intentions with Auxiliary Classifier GANs
Jiahao Lin ... Zongzhang Zhang
-
Jiahao Lin, et. al.Jiahao Lin ... Zongzhang Zhang
01 Jan 2018
01 Jan 2018

Face Recognition via Domain Adaptation and Manifold Distance Metric Learning
Bo Li ... Ping-Ping Zheng
-
Bo Li, et. al.Bo Li ... Ping-Ping Zheng
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Domain Adaptation for Imitation Learning Using Generative Adversarial Network.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)