Balance Controller Design for Inverted Pendulum Considering Detail Reward Function and Two-Phase Learning Protocol

Xiaochen Liu,Xingxing Li,Sipeng Wang,Ze Cui

doi:10.3390/sym16091227

Abstract

As a complex nonlinear system, the inverted pendulum (IP) system has the characteristics of asymmetry and instability. In this paper, the IP system is controlled by a learned deep neural network (DNN) that directly maps the system states to control commands in an end-to-end style. On the basis of deep reinforcement learning (DRL), the detail reward function (DRF) is designed to guide the DNN learning control strategy, which greatly enhances the pertinence and flexibility of the control. Moreover, a two-phase learning protocol (offline learning phase and online learning phase) is proposed to solve the “real gap” problem of the IP system. Firstly, the DNN learns the offline control strategy based on a simplified IP dynamic model and DRF. Then, a security controller is designed and used on the IP platform to optimize the DNN online. The experimental results demonstrate that the DNN has good robustness to model errors after secondary learning on the platform. When the length of the pendulum is reduced by 25% or increased by 25%, the steady-state error of the pendulum angle is less than 0.05 rad. The error is within the allowable range. The DNN is robust to changes in the length of the pendulum. The DRF and the two-phase learning protocol improve the adaptability of the controller to the complex and variable characteristics of the real platform and provide reference for other learning-based robot control problems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Balance Controller Design for Inverted Pendulum Considering Detail Reward Function and Two-Phase Learning Protocol

Abstract

Talk to us

Similar Papers

More From: Symmetry

Lead the way for us

Journal: Symmetry	Publication Date: Sep 18, 2024
License type: CC BY 4.0

Similar Papers

Transparency and Explanation in Deep Reinforcement Learning Neural Networks
Rahul Iyer ... Yuezhang Li
-
Rahul Iyer, et. al.Rahul Iyer ... Yuezhang Li
27 Dec 2018
27 Dec 2018

Artificial Intelligence and the Common Sense of Animals.
Murray Shanahan ... Benjamin Beyret
Trends in Cognitive Sciences | VOL. 24
Murray Shanahan, et. al.Murray Shanahan ... Benjamin Beyret
08 Oct 2020
Trends in Cognitive Sciences | VOL. 24

Verbal Explanations for Deep Reinforcement Learning Neural Networks with Attention on Extracted Features
Xinzhi Wang ... Shengcheng Yuan
-
Xinzhi Wang, et. al.Xinzhi Wang ... Shengcheng Yuan
01 Oct 2019
01 Oct 2019

Break through the limits of learning by machines
Zhongzhi Shi
Chinese Science Bulletin | VOL. 61
Zhongzhi ShiZhongzhi Shi
20 Sep 2016
Chinese Science Bulletin | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Balance Controller Design for Inverted Pendulum Considering Detail Reward Function and Two-Phase Learning Protocol

Abstract

Talk to us

Similar Papers

More From: Symmetry