A real-time HIL control system on rotary inverted pendulum hardware platform based on double deep Q-network

Yanyan Dai,Sukgyu Lee,Kidong Lee

doi:10.1177/00202940211000380

Abstract

For real applications, rotary inverted pendulum systems have been known as the basic model in nonlinear control systems. If researchers have no deep understanding of control, it is difficult to control a rotary inverted pendulum platform using classic control engineering models, as shown in section 2.1. Therefore, without classic control theory, this paper controls the platform by training and testing reinforcement learning algorithm. Many recent achievements in reinforcement learning (RL) have become possible, but there is a lack of research to quickly test high-frequency RL algorithms using real hardware environment. In this paper, we propose a real-time Hardware-in-the-loop (HIL) control system to train and test the deep reinforcement learning algorithm from simulation to real hardware implementation. The Double Deep Q-Network (DDQN) with prioritized experience replay reinforcement learning algorithm, without a deep understanding of classical control engineering, is used to implement the agent. For the real experiment, to swing up the rotary inverted pendulum and make the pendulum smoothly move, we define 21 actions to swing up and balance the pendulum. Comparing Deep Q-Network (DQN), the DDQN with prioritized experience replay algorithm removes the overestimate of Q value and decreases the training time. Finally, this paper shows the experiment results with comparisons of classic control theory and different reinforcement learning algorithms.

Highlights

Along with analysis and control from linear systems to nonlinear systems, designing the control system becomes more and more sophisticated
Inverted pendulum systems6 have been known as the basic model for real engineering applications
Without classic control theory, this paper controls the platform by training and testing reinforcement learning algorithm

Summary

Introduction

Along with analysis and control from linear systems to nonlinear systems, designing the control system becomes more and more sophisticated. In Polzounov and Redden a control tool is used to train and test reinforcement learning algorithms on a rotary inverted pendulum platform. In Kim et al. it controls rotary inverted pendulum using deep reinforcement learning rather than classical control engineering. We create a real-time Hardware-in-theloop (HIL) control system to swing up and balance the pendulum using a deep reinforcement learning algorithm rather than classical control engineering. The control system includes four parts: rotary inverted pendulum platform part, HIL interface software part, RL environment part, and agent part. The third section describes how to use double deep Q-network (DDQN) with prioritized experience replay reinforcement learning algorithm to swing up and balance the real rotary inverted pendulum.

Hardware

Conclusion