2022
DOI: 10.48550/arxiv.2205.07554
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Towards on-sky adaptive optics control using reinforcement learning

J. Nousiainen,
C. Rajani,
M. Kasper
et al.

Abstract: Context. The direct imaging of potentially habitable Exoplanets is one prime science case for the next generation of high contrast imaging instruments on ground-based extremely large telescopes. To reach this demanding science goal, the instruments are equipped with eXtreme Adaptive Optics (XAO) systems which will control thousands of actuators at a framerate of kilohertz to several kilohertz. Most of the habitable exoplanets are located at small angular separations from their host stars, where the current con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(9 citation statements)
references
References 50 publications
0
9
0
Order By: Relevance
“…24 MagAO-X can also be used for remote research and development, as demonstrated by the testing of a reinforcement learning algorithm. 25 MagAO-X is also part of the High-Contrast Adaptive Optics Testbed (HCAT) for the Giant Magellan Telescope (GMT), part of the overall program to develop segment phase sensing and control strategies for the GMT. 7 As part of HCAT MagAO-X serves as the AO system and houses phase sensing and control experiments.…”
Section: Lab Testbedmentioning
confidence: 99%
“…24 MagAO-X can also be used for remote research and development, as demonstrated by the testing of a reinforcement learning algorithm. 25 MagAO-X is also part of the High-Contrast Adaptive Optics Testbed (HCAT) for the Giant Magellan Telescope (GMT), part of the overall program to develop segment phase sensing and control strategies for the GMT. 7 As part of HCAT MagAO-X serves as the AO system and houses phase sensing and control experiments.…”
Section: Lab Testbedmentioning
confidence: 99%
“…8 On the other hand, a few predictive models based on reinforcement learning (RL) have been developed. [9][10][11][12][13][14][15] In this work, we focus on closed-loop RL methods with a wavefront sensor (WFS), in which the goal is to learn the control policy, a function that maps information of the system to commands, during operation. [11][12][13][14][15] In this case, there is no distribution mismatch between training and operational data as they are the same.…”
Section: Introductionmentioning
confidence: 99%
“…[9][10][11][12][13][14][15] In this work, we focus on closed-loop RL methods with a wavefront sensor (WFS), in which the goal is to learn the control policy, a function that maps information of the system to commands, during operation. [11][12][13][14][15] In this case, there is no distribution mismatch between training and operational data as they are the same. Moreover, in contrast to most current model-based predictive controllers, there are no assumptions, and the control dynamics are learnt purely from data.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…[20][21][22] A promising area of research for mitigating these nonlinearities is the use of neural networks for learning a nonlinear mapping between wavefront sensor measurements and wavefront, [23][24][25][26] or for nonlinear control. [27][28][29][30] Furthermore, the similarities between optical systems and Neural Networks have lead to studies exploiting automatic differentiation algorithms, initially developed for training NNs, for optimizing elements in the optical system 31,32 or more efficient wavefront control. 33,34 Automatic differentiation allows us to obtain gradients with respect to the free design parameters, even for complex optical systems with multiple elements and planes.…”
Section: Introductionmentioning
confidence: 99%