Deep deterministic policy gradient with constraints for gait optimisation of biped robots

Xingyang Liu,Peng Yue,Gexiang Zhang,Ferrante Neri,Haina Rong

doi:10.3233/ica-230724

Abstract

In this paper, we propose a novel Reinforcement Learning (RL) algorithm for robotic motion control, that is, a constrained Deep Deterministic Policy Gradient (DDPG) deviation learning strategy to assist biped robots in walking safely and accurately. The previous research on this topic highlighted the limitations in the controller’s ability to accurately track foot placement on discrete terrains and the lack of consideration for safety concerns. In this study, we address these challenges by focusing on ensuring the overall system’s safety. To begin with, we tackle the inverse kinematics problem by introducing constraints to the damping least squares method. This enhancement not only addresses singularity issues but also guarantees safe ranges for joint angles, thus ensuring the stability and reliability of the system. Based on this, we propose the adoption of the constrained DDPG method to correct controller deviations. In constrained DDPG, we incorporate a constraint layer into the Actor network, incorporating joint deviations as state inputs. By conducting offline training within the range of safe angles, it serves as a deviation corrector. Lastly, we validate the effectiveness of our proposed approach by conducting dynamic simulations using the CRANE biped robot. Through comprehensive assessments, including singularity analysis, constraint effectiveness evaluation, and walking experiments on discrete terrains, we demonstrate the superiority and practicality of our approach in enhancing walking performance while ensuring safety. Overall, our research contributes to the advancement of biped robot locomotion by addressing gait optimisation from multiple perspectives, including singularity handling, safety constraints, and deviation learning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep deterministic policy gradient with constraints for gait optimisation of biped robots

Abstract

Talk to us

Similar Papers

More From: Integrated Computer-Aided Engineering

Lead the way for us

Journal: Integrated Computer-Aided Engineering	Publication Date: Jan 30, 2024
Citations: 1

Similar Papers

Safe Reinforcement Learning Benchmark Environments for Aerospace Control Systems
Umberto J Ravaioli ... Kerianne L Hobbs
-
Umberto J Ravaioli, et. al.Umberto J Ravaioli ... Kerianne L Hobbs
05 Mar 2022
05 Mar 2022

A parallel heterogeneous policy deep reinforcement learning algorithm for bipedal walking motion design.
Chunguang Li ... Chongben Tao
Frontiers in Neurorobotics | VOL. 17
Chunguang Li, et. al.Chunguang Li ... Chongben Tao
08 Aug 2023
Frontiers in Neurorobotics | VOL. 17

Biped dynamic walking using reinforcement learning
Hamid Benbrahim ... Judy A Franklin
Robotics and Autonomous Systems | VOL. 22
Hamid Benbrahim, et. al.Hamid Benbrahim ... Judy A Franklin
01 Dec 1997
Robotics and Autonomous Systems | VOL. 22

Dynamic Economic Optimization of a Continuously Stirred Tank Reactor Using Reinforcement Learning
Derek Machalek ... Titus Quah
-
Derek Machalek, et. al.Derek Machalek ... Titus Quah
01 Jul 2020
01 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep deterministic policy gradient with constraints for gait optimisation of biped robots

Abstract

Talk to us

Similar Papers

More From: Integrated Computer-Aided Engineering