MURM: Utilization of Multi-Views for Goal-Conditioned Reinforcement Learning in Robotic Manipulation

Seongwon Jang,Hyemi Jeong,Hyunseok Yang

doi:10.3390/robotics12040119

Abstract

We present a novel framework, multi-view unified reinforcement learning for robotic manipulation (MURM), which efficiently utilizes multiple camera views to train a goal-conditioned policy for a robot to perform complex tasks. The MURM framework consists of three main phases: (i) demo collection from an expert, (ii) representation learning, and (iii) offline reinforcement learning. In the demo collection phase, we design a scripted expert policy that uses privileged information, such as Cartesian coordinates of a target and goal, to solve the tasks. We add noise to the expert policy to provide sufficient interactive information about the environment, as well as suboptimal behavioral trajectories. We designed three tasks in a Pybullet simulation environment, including placing an object in a desired goal position and picking up various objects that are randomly positioned in the environment. In the representation learning phase, we use a vector-quantized variational autoencoder (VQVAE) to learn a more structured latent representation that makes it feasible to train for RL compared to high-dimensional raw images. We train VQVAE models for each distinct camera view and define the best viewpoint settings for training. In the offline reinforcement learning phase, we use the Implicit Q-learning (IQL) algorithm as our baseline and introduce a separated Q-functions method and dropout method that can be implemented in multi-view settings to train the goal-conditioned policy with supervised goal images. We conduct experiments in simulation and show that the single-view baseline fails to solve complex tasks, whereas MURM is successful.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MURM: Utilization of Multi-Views for Goal-Conditioned Reinforcement Learning in Robotic Manipulation

Abstract

Talk to us

Similar Papers

More From: Robotics

Lead the way for us

Journal: Robotics	Publication Date: Aug 19, 2023
License type: CC BY 4.0

Similar Papers

What can classic Atari video games tell us about the human brain?
Raphael Köster ... Martin J Chadwick
Neuron | VOL. 109
Raphael Köster, et. al.Raphael Köster ... Martin J Chadwick
01 Feb 2021
Neuron | VOL. 109

A Spectrum Handoff Method Based on Reinforcement and Transfer Learning
Jiaxing Zhao ... Fuchang Li
-
Jiaxing Zhao, et. al.Jiaxing Zhao ... Fuchang Li
01 Aug 2020
01 Aug 2020

Collaborative Unsupervised Multi-View Representation Learning
Qinghai Zheng ... Zhongyu Li
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32
Qinghai Zheng, et. al.Qinghai Zheng ... Zhongyu Li
01 Jul 2022
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32

SURRL: Structural Unsupervised Representations for Robot Learning
Fengyi Zhang ... Zhiyong Liu
IEEE Transactions on Cognitive and Developmental Systems | VOL. 15
Fengyi Zhang, et. al.Fengyi Zhang ... Zhiyong Liu
01 Jun 2023
IEEE Transactions on Cognitive and Developmental Systems | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MURM: Utilization of Multi-Views for Goal-Conditioned Reinforcement Learning in Robotic Manipulation

Abstract

Talk to us

Similar Papers

More From: Robotics