Adjustable and Adaptive Control for an Unstable Mobile Robot Using Imitation Learning with Trajectory Optimization

Christian Dengler,Boris Lohmann

doi:10.3390/robotics9020029

Abstract

In this contribution, we develop a feedback controller in the form of a parametric function for a mobile inverted pendulum. The control both stabilizes the system and drives it to target positions with target orientations. A design of the controller based only on a cost function is difficult for this task, which is why we choose to train the controller using imitation learning on optimized trajectories. In contrast to popular approaches like policy gradient methods, this approach allows us to shape the behavior of the system by including equality constraints. When transferring the parametric controller from simulation to the real mobile inverted pendulum, the control performance is degraded due to the reality gap. A robust control design can reduce the degradation. However, for the framework of imitation learning on optimized trajectories, methods that explicitly consider robustness do not yet exist to the knowledge of the authors. We tackle this research gap by presenting a method to design a robust controller in the form of a recurrent neural network, to improve the transferability of the trained controller to the real system. As a last step, we make the behavior of the parametric controller adjustable to allow for the fine tuning of the behavior of the real system. We design the controller for our system and show in the application that the recurrent neural network has increased performance compared to a static neural network without robustness considerations.

Highlights

The control of mobile and unstable systems is, in most cases, divided into a stabilizing and a maneuvering part, e.g., [1]
Methods based on optimization and learning come into play that train a control law in the form of a parametric function based on a cost function
The cost function c( xt, ut ) is the same that was used for the trajectory optimization

Summary

Introduction

The control of mobile and unstable systems is, in most cases, divided into a stabilizing and a maneuvering part, e.g., [1]. This breakdown of the problem into two separate tasks makes analytic control designs manageable, the final performance of the system will be limited compared to holistic approaches. For systems with relatively slow dynamics, nonlinear model predictive control can be used [3], which is, not suited for fast systems due to the continuous online optimization with non-deterministic computing time. We design a parametric controller for the position and orientation control of a mobile inverted pendulum (MIP), without the need to compute trajectories online

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Robotics	Publication Date: Apr 25, 2020
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Adjustable and Adaptive Control for an Unstable Mobile Robot Using Imitation Learning with Trajectory Optimization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Robotics

Lead the way for us

Similar Papers

Adaptive Control Based On Neural Network
Sun Wei ... Zhang Lujin
-
Sun Wei, et. al.Sun Wei ... Zhang Lujin
01 Jan 2009
01 Jan 2009

Neural network position and orientation control of an inverted pendulum on wheels
Christian Dengler ... Lohmann Boris
-
Christian Dengler, et. al.Christian Dengler ... Lohmann Boris
01 Dec 2019
01 Dec 2019

System identification for robust and inferential control : with applications to ILC and precision motion systems

-

18 Nov 2015
18 Nov 2015

The Sixth International Symposium on Neural Networks (ISNN 2009)
-
-
--
01 Jan 2009
The Sixth International Symposium on Neural Networks (ISNN 2009)
-

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adjustable and Adaptive Control for an Unstable Mobile Robot Using Imitation Learning with Trajectory Optimization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Robotics