Abstract

For real applications, rotary inverted pendulum systems have been known as the basic model in nonlinear control systems. If researchers have no deep understanding of control, it is difficult to control a rotary inverted pendulum platform using classic control engineering models, as shown in section 2.1. Therefore, without classic control theory, this paper controls the platform by training and testing reinforcement learning algorithm. Many recent achievements in reinforcement learning (RL) have become possible, but there is a lack of research to quickly test high-frequency RL algorithms using real hardware environment. In this paper, we propose a real-time Hardware-in-the-loop (HIL) control system to train and test the deep reinforcement learning algorithm from simulation to real hardware implementation. The Double Deep Q-Network (DDQN) with prioritized experience replay reinforcement learning algorithm, without a deep understanding of classical control engineering, is used to implement the agent. For the real experiment, to swing up the rotary inverted pendulum and make the pendulum smoothly move, we define 21 actions to swing up and balance the pendulum. Comparing Deep Q-Network (DQN), the DDQN with prioritized experience replay algorithm removes the overestimate of Q value and decreases the training time. Finally, this paper shows the experiment results with comparisons of classic control theory and different reinforcement learning algorithms.

Highlights

  • Along with analysis and control from linear systems to nonlinear systems, designing the control system becomes more and more sophisticated

  • Inverted pendulum systems6 have been known as the basic model for real engineering applications

  • Without classic control theory, this paper controls the platform by training and testing reinforcement learning algorithm

Read more

Summary

Introduction

Along with analysis and control from linear systems to nonlinear systems, designing the control system becomes more and more sophisticated. In Polzounov and Redden a control tool is used to train and test reinforcement learning algorithms on a rotary inverted pendulum platform. In Kim et al. it controls rotary inverted pendulum using deep reinforcement learning rather than classical control engineering. We create a real-time Hardware-in-theloop (HIL) control system to swing up and balance the pendulum using a deep reinforcement learning algorithm rather than classical control engineering. The control system includes four parts: rotary inverted pendulum platform part, HIL interface software part, RL environment part, and agent part. The third section describes how to use double deep Q-network (DDQN) with prioritized experience replay reinforcement learning algorithm to swing up and balance the real rotary inverted pendulum.

Hardware
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call