Adaptive optics control with multi-agent model-free reinforcement learning.

B Pou,F Ferreira,E Quinones,M Martin,D Gratadour

doi:10.1364/oe.444099

Abstract

We present a novel formulation of closed-loop adaptive optics (AO) control as a multi-agent reinforcement learning (MARL) problem in which the controller is able to learn a non-linear policy and does not need a priori information on the dynamics of the atmosphere. We identify the different challenges of applying a reinforcement learning (RL) method to AO and, to solve them, propose the combination of model-free MARL for control with an autoencoder neural network to mitigate the effect of noise. Moreover, we extend current existing methods of error budget analysis to include a RL controller. The experimental results for an 8m telescope equipped with a 40x40 Shack-Hartmann system show a significant increase in performance over the integrator baseline and comparable performance to a model-based predictive approach, a linear quadratic Gaussian controller with perfect knowledge of atmospheric conditions. Finally, the error budget analysis provides evidence that the RL controller is partially compensating for bandwidth error and is helping to mitigate the propagation of aliasing.

Highlights

Closed-loop Adaptive Optics (AO) systems are a fundamental component of large telescopes to correct dynamically evolving wavefront aberrations introduced by the atmosphere in real-time
We present a novel formulation of closed-loop adaptive optics (AO) control as a multi-agent reinforcement learning (MARL) problem in which the controller is able to learn a non-linear policy and does not need a priori information on the dynamics of the atmosphere
We have identified the different challenges of designing a RL method for AO control and proposed a solution with a Multi-Agent model-free Reinforcement Learning controller working as an additive corrector on top of an integrator controller with an autoencoder to mitigate the impact of noise

Summary

Introduction

Closed-loop Adaptive Optics (AO) systems are a fundamental component of large telescopes to correct dynamically evolving wavefront aberrations introduced by the atmosphere in real-time. The classical AO controller relies on a linear relationship between measurements from the WFS and DM commands and an integrator for which a gain factor is used to mitigate errors introduced by e.g. intrinsic loop delay and temporal sampling. More advanced approaches have been developed under the form of model-based predictive controllers which predict future wavefront distortions and issue the appropriate commands to solve them. This is the case of linear quadratic Gaussian (LQG) controllers, first introduced in [1], which involve a linear dynamics model to describe the evolution of the system, a quadratic loss function to be optimised and Kalman filters to process noisy inputs. The integration of residual commands with previous commands applied on the DM, Ct−1, using a weight, the so-called gain g, mitigates the effects of this imperfect wavefront reconstruction and gives the final expression for a command at timestep t, Ct, for the integrator controller as seen in Eq (2)

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Optics Express	Publication Date: Jan 14, 2022
Citations: 17	License type: cc-by

R Discovery Prime

R Discovery Prime

Adaptive optics control with multi-agent model-free reinforcement learning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Optics Express

Lead the way for us

Similar Papers

FPGA based adaptive optics control system
S Lynch ... F Morgan
-
S Lynch, et. al.S Lynch ... F Morgan
01 Jan 2008
01 Jan 2008

On application of constrained receding horizon control in astronomical adaptive optics
Mikhail V Konnik ... James Stuart Welsh
-
Mikhail V Konnik, et. al.Mikhail V Konnik ... James Stuart Welsh
13 Sep 2012
13 Sep 2012

Kalman filtering to suppress spurious signals in adaptive optics control
Lisa A Poyneer ... Jean-Pierre Véran
Journal of the Optical Society of America A | VOL. 27
Lisa A Poyneer, et. al.Lisa A Poyneer ... Jean-Pierre Véran
27 Sep 2010
Journal of the Optical Society of America A | VOL. 27

Experimental study on modified linear quadratic Gaussian control for adaptive optics
Qiang Fu ... Changhui Rao
Applied Optics | VOL. 53
Qiang Fu, et. al.Qiang Fu ... Changhui Rao
07 Mar 2014
Applied Optics | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive optics control with multi-agent model-free reinforcement learning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Optics Express