Abstract

In this contribution, we develop a feedback controller in the form of a parametric function for a mobile inverted pendulum. The control both stabilizes the system and drives it to target positions with target orientations. A design of the controller based only on a cost function is difficult for this task, which is why we choose to train the controller using imitation learning on optimized trajectories. In contrast to popular approaches like policy gradient methods, this approach allows us to shape the behavior of the system by including equality constraints. When transferring the parametric controller from simulation to the real mobile inverted pendulum, the control performance is degraded due to the reality gap. A robust control design can reduce the degradation. However, for the framework of imitation learning on optimized trajectories, methods that explicitly consider robustness do not yet exist to the knowledge of the authors. We tackle this research gap by presenting a method to design a robust controller in the form of a recurrent neural network, to improve the transferability of the trained controller to the real system. As a last step, we make the behavior of the parametric controller adjustable to allow for the fine tuning of the behavior of the real system. We design the controller for our system and show in the application that the recurrent neural network has increased performance compared to a static neural network without robustness considerations.

Highlights

  • The control of mobile and unstable systems is, in most cases, divided into a stabilizing and a maneuvering part, e.g., [1]

  • Methods based on optimization and learning come into play that train a control law in the form of a parametric function based on a cost function

  • The cost function c( xt, ut ) is the same that was used for the trajectory optimization

Read more

Summary

Introduction

The control of mobile and unstable systems is, in most cases, divided into a stabilizing and a maneuvering part, e.g., [1]. This breakdown of the problem into two separate tasks makes analytic control designs manageable, the final performance of the system will be limited compared to holistic approaches. For systems with relatively slow dynamics, nonlinear model predictive control can be used [3], which is, not suited for fast systems due to the continuous online optimization with non-deterministic computing time. We design a parametric controller for the position and orientation control of a mobile inverted pendulum (MIP), without the need to compute trajectories online

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call