Proceedings of the 24th International Conference on Hybrid Systems: Computation and Control 2021
DOI: 10.1145/3447928.3456707
|View full text |Cite
|
Sign up to set email alerts
|

A few lessons learned in reinforcement learning for quadcopter attitude control

Abstract: In the context of developing safe air transportation, our work is focused on understanding how Reinforcement Learning methods can improve the state of the art in traditional control, in nominal as well as non-nominal cases. The end goal is to train provably safe controllers, by improving both training and verification methods. In this paper, we explore this path for controlling the attitude of a quadcopter: we discuss theoretical as well as practical aspects of training neural nets for controlling a crazyflie … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(2 citation statements)
references
References 27 publications
0
2
0
Order By: Relevance
“…Such semantics have been defined to assess the robustness of systems to parameter or timing variations [6]. It has also been used recently [4] in the context of controller synthesis with reinforcement learning to provide a reward associated to the formalized STL properties. STL as part of the Model Predictive Control (MPC) design has also been studied in [18], in which the combinatorial nature of such specifications is preserved, and thus Mixed-Integer Linear Programming (MILP) has to be employed.…”
Section: State Of the Artmentioning
confidence: 99%
See 1 more Smart Citation
“…Such semantics have been defined to assess the robustness of systems to parameter or timing variations [6]. It has also been used recently [4] in the context of controller synthesis with reinforcement learning to provide a reward associated to the formalized STL properties. STL as part of the Model Predictive Control (MPC) design has also been studied in [18], in which the combinatorial nature of such specifications is preserved, and thus Mixed-Integer Linear Programming (MILP) has to be employed.…”
Section: State Of the Artmentioning
confidence: 99%
“…Note that in (5), (6), and (7), we still have non-differentiable min-or maxfunctions. One can certainly use the log-sum-exponential function in (4) to approximate these min-/max-functions, but this approach is usually not accurate around non-differentiable points, especially with only two variables, which is exactly the case we have on hand.…”
Section: Polynomial Smooth Min Smin()mentioning
confidence: 99%