Reinforcement Learning With Evolutionary Trajectory Generator: A General Approach for Quadrupedal Locomotion

Haojie Shi,Jiangyong Li,Kang Wang,Yueqiang Dong,Bo Zhou,Max Q.-H Meng,Hao Tian,Fan Wang,Hongsheng Zeng

doi:10.1109/lra.2022.3145495

Abstract

Recently reinforcement learning (RL) has emerged as a promising approach for quadrupedal locomotion, which can save the manual effort in conventional approaches such as designing skill-specific controllers. However, due to the complex nonlinear dynamics in quadrupedal robots and reward sparsity, it is still difficult for RL to learn effective gaits from scratch, especially in challenging tasks such as walking over the balance beam. To alleviate such difficulty, we propose a novel RL-based approach that contains an evolutionary foot trajectory generator. Unlike prior methods that use a fixed trajectory generator, the generator continually optimizes the shape of the output trajectory for the given task, providing diversified motion priors to guide the policy learning. The policy is trained with reinforcement learning to output residual control signals that fit different gaits. We then optimize the trajectory generator and policy network alternatively to stabilize the training and share the exploratory data to improve sample efficiency. As a result, our approach can solve a range of challenging tasks in simulation by learning from scratch, including walking on a balance beam and crawling through the cave. To further verify the effectiveness of our approach, we deploy the controller learned in the simulation on a 12-DoF quadrupedal robot, and it can successfully traverse challenging scenarios with efficient gaits. We provide a video to show the learned gaits in different tasks in YouTube. <xref ref-type="fn" rid="fn1" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><sup>1</sup></xref> <fn id="fn1" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><label><sup>1</sup></label> [Online]. Available: <uri>youtube.com/watch?v=hgBLR09MEOw</uri>, and code is available in Github: <uri>github.com/PaddlePaddle/PaddleRobotics</uri> </fn>

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reinforcement Learning With Evolutionary Trajectory Generator: A General Approach for Quadrupedal Locomotion

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters

Lead the way for us

Journal: IEEE Robotics and Automation Letters	Publication Date: Apr 1, 2022
Citations: 26

Similar Papers

A Hierarchical Framework for Quadruped Robots Gait Planning Based on DDPG.
Yanbiao Li ... Chentao Wu
Biomimetics | VOL. 8
Yanbiao Li, et. al.Yanbiao Li ... Chentao Wu
22 Aug 2023
Biomimetics | VOL. 8

Automated Hyperparameter Tuning in Reinforcement Learning for Quadrupedal Robot Locomotion
Myeongseop Kim ... Jae-Han Park
Electronics | VOL. 13
Myeongseop Kim, et. al.Myeongseop Kim ... Jae-Han Park
27 Dec 2023
Electronics | VOL. 13

Posture Correction of Quadruped Robot for Adaptive Slope Walking
Chenxiao Yu ... Yangsheng Xu
-
Chenxiao Yu, et. al.Chenxiao Yu ... Yangsheng Xu
01 Dec 2018
01 Dec 2018

Multi-Phase Joint-Angle Trajectory Generation Inspired by Dog Motion for Control of Quadruped Robot.
Jungsu Choi
Sensors (Basel, Switzerland) | VOL. 21
Jungsu ChoiJungsu Choi
24 Sep 2021
Sensors (Basel, Switzerland) | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning With Evolutionary Trajectory Generator: A General Approach for Quadrupedal Locomotion

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters