LORM: a novel reinforcement learning framework for biped gait control.

Weiyi Zhang,Yancao Jiang,Fasih Ud Din Farrukh,Chun Zhang,Debing Zhang,Guangqi Wang

doi:10.7717/peerj-cs.927

Weiyi Zhang, Yancao Jiang + Show 4 more

Open Access

https://doi.org/10.7717/peerj-cs.927

Copy DOI

Export

Save

Cite

Journal: PeerJ. Computer science	Publication Date: Mar 28, 2022
Citations: 5	License type: CC BY 4.0

Abstract
Full-Text
Similar Papers

Abstract

Listen

Legged robots are better able to adapt to different terrains compared with wheeled robots. However, traditional motion controllers suffer from extremely complex dynamics properties. Reinforcement learning (RL) helps to overcome the complications of dynamics design and calculation. In addition, the high autonomy of the RL controller results in a more robust response to complex environments and terrains compared with traditional controllers. However, RL algorithms are limited by the problems of convergence and training efficiency due to the complexity of the task. Learn and outperform the reference motion (LORM), an RL based framework for gait controlling of biped robot is proposed leveraging the prior knowledge of reference motion. The proposed trained agent outperformed the reference motion and existing motion-based methods. The RL environment was finely crafted for optimal performance, including the pruning of state space and action space, reward shaping, and design of episode criterion. Several improvements were implemented to further improve the training efficiency and performance including: random state initialization (RSI), the noise of joint angles, and a novel improvement based on symmetrization of gait. To validate the proposed method, the Darwin-op robot was set as the target platform and two different tasks were designed: (I) Walking as fast as possible and (II) Tracking specific velocity. In task (I), the proposed method resulted in the walking velocity of 0.488 m/s, with a 5.8 times improvement compared with the original traditional reference controller. The directional accuracy improved by 87.3%. The velocity performance achieved 2× compared with the rated max velocity and more than 8× compared with other recent works. To our knowledge, our work achieved the best velocity performance on the platform Darwin-op. In task (II), the proposed method achieved a tracking accuracy of over 95%. Different environments are introduced including plains, slopes, uneven terrains, and walking with external force, where the robot was expected to maintain walking stability with ideal speed and little direction deviation, to validate the performance and robustness of the proposed method.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

LORM: a novel reinforcement learning framework for biped gait control.

Abstract

Published Version

Talk to us

Similar Papers

More From: PeerJ. Computer science

Lead the way for us

Similar Papers

Dynamic Economic Optimization of a Continuously Stirred Tank Reactor Using Reinforcement Learning
Derek Machalek ... Titus Quah
-
Derek Machalek, et. al.Derek Machalek ... Titus Quah
01 Jul 2020
01 Jul 2020

Motion Sequence Learning for Robot Walking Based on Pose optimization
Yancao Jiang ... Chun Zhang
-
Yancao Jiang, et. al.Yancao Jiang ... Chun Zhang
13 Oct 2020
13 Oct 2020

Learning to Locomote: Understanding How Environment Design Matters for Deep Reinforcement Learning
Daniele Reda ... Tianxin Tao
-
Daniele Reda, et. al.Daniele Reda ... Tianxin Tao
16 Oct 2020
16 Oct 2020

Towards Generalization and Efficiency in Reinforcement Learning

-

02 Jul 2019
02 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

LORM: a novel reinforcement learning framework for biped gait control.

Abstract

Published Version

Talk to us

Similar Papers

More From: PeerJ. Computer science