Abstract
In this book, we show how to use RL techniques to unify optimal control and adaptive control. By this we mean that a novel class of adaptive control structures will be developed that learn the solutions of optimal control problems in real time by measuring data along the system trajectories online. We call these optimal adaptive controllers. These optimal adaptive controllers have structures based on the actor-critic learning architecture.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have