Increasing the Efficiency of Policy Learning for Autonomous Vehicles by Multi-Task Representation Learning

Eshagh Kargar,Ville Kyrki

doi:10.1109/tiv.2022.3149891

Eshagh Kargar, Ville Kyrki

Open Access

https://doi.org/10.1109/tiv.2022.3149891

Copy DOI

Journal: IEEE Transactions on Intelligent Vehicles	Publication Date: Sep 1, 2022
Citations: 5	License type: CC BY 4.0

Affiliation: Aalto University

Abstract

Driving in a dynamic, multi-agent, and complex urban environment is a difficult task requiring a complex decision-making policy. The learning of such a policy requires a state representation that can encode the entire environment. Mid-level representations that encode a vehicle’s environment as images have become a popular choice. Still, they are quite high-dimensional, limiting their use in data-hungry approaches such as reinforcement learning. In this article, we propose to learn a low-dimensional and rich latent representation of the environment by leveraging the knowledge of relevant semantic factors. To do this, we train an encoder-decoder deep neural network to predict multiple application-relevant factors such as the trajectories of other agents and the ego car. Furthermore, we propose a hazard signal based on other vehicles’ future trajectories and the planned route which is used in conjunction with the learned latent representation as input to a down-stream policy. We demonstrate that using the multi-head encoder-decoder neural network results in a more informative representation than a standard single-head model. In particular, the proposed representation learning and the hazard signal help reinforcement learning to learn faster, with increased performance and less data than baseline methods.

Highlights

D RIVING in unstructured and dynamic urban environments is an arduous task
Even the midlevel representations may be so high-dimensional that their use with data-hungry methods such as Reinforcement Learning (RL) is limited
The primary contributions of this work are: (a) a multitask network with auxiliary heads to improve the quality of low-dimensional representations, (b) a hazard signal calculated by the likelihood between route and predicted trajectories of dynamic agents, and (c) an experimental study of an RL policy learning, showing that the learned latent-vector by using auxiliary tasks and the hazard signal, can help the policy to be (i) trained faster, (ii) perform better, (iii) learn to solve the task using less data, (iv) and generalize better to new scenarios

Summary

INTRODUCTION

D RIVING in unstructured and dynamic urban environments is an arduous task. Many moving agents such as cars, bicycles, and pedestrians affect driver’s behavior and decisions. The primary contributions of this work are: (a) a multitask network with auxiliary heads to improve the quality of low-dimensional representations, (b) a hazard signal calculated by the likelihood between route and predicted trajectories of dynamic agents, and (c) an experimental study of an RL policy learning, showing that the learned latent-vector by using auxiliary tasks and the hazard signal, can help the policy to be (i) trained faster, (ii) perform better, (iii) learn to solve the task using less data, (iv) and generalize better to new scenarios.

RELATED WORK

Variational Auto-Encoder

Reinforcement Learning

Deep Q-Network

METHOD

Learning Latent Representation Using Multi-Head VAE

Generation of Hazard Signal

Policy Learning Using DQN

EXPERIMENTS

Simulation Environment and Data Collection

Implementation Details

Effect of Different Heads and the Hazard Signal

Comparison to Baselines

Method

Effect of the Dataset Size

Qualitative Analysis

Generalization Analysis

Findings

CONCLUSIONS

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Increasing the Efficiency of Policy Learning for Autonomous Vehicles by Multi-Task Representation Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Intelligent Vehicles

Lead the way for us

Similar Papers

Feature selection via Non-convex constraint and latent representation learning with Laplacian embedding
Ronghua Shang ... Licheng Jiao
Expert Systems with Applications | VOL. 208
Ronghua Shang, et. al.Ronghua Shang ... Licheng Jiao
22 Jul 2022
Expert Systems with Applications | VOL. 208

Gene selection for microarray data classification via dual latent representation learning
Xiao Zheng ... Chujie Zhang
Neurocomputing | VOL. 461
Xiao Zheng, et. al.Xiao Zheng ... Chujie Zhang
22 Jul 2021
Neurocomputing | VOL. 461

Multiview Clustering via Proximity Learning in Latent Representation Space.
Bao-Yu Liu ... Jian-Huang Lai
IEEE Transactions on Neural Networks and Learning Systems | VOL. 34
Bao-Yu Liu, et. al.Bao-Yu Liu ... Jian-Huang Lai
01 Feb 2023
IEEE Transactions on Neural Networks and Learning Systems | VOL. 34

Sparse Relational Topic Models for Document Networks
Aonan Zhang ... Jun Zhu
-
Aonan Zhang, et. al.Aonan Zhang ... Jun Zhu
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Increasing the Efficiency of Policy Learning for Autonomous Vehicles by Multi-Task Representation Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Intelligent Vehicles