2020 American Control Conference (ACC) 2020
DOI: 10.23919/acc45564.2020.9147920
|View full text |Cite
|
Sign up to set email alerts
|

Anticipating the long-term effect of online learning in control

Abstract: Control tasks with high levels of uncertainty and safety requirements are increasingly common. Typically, techniques that guarantee safety during learning and control utilize constraint-based safety certificates, which can be leveraged to compute safe control inputs. However, if model uncertainty is very high, the corresponding certificates are potentially invalid, meaning no control input satisfies the constraints imposed by the safety certificate. This paper considers a learning-based setting with a safety c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 36 publications
0
7
0
Order By: Relevance
“…where the expectation in (19) and the probability in (20) refer to the realization of the uncertainties/disturbance vector profile w. The condensed expression g(u, x k , w) ≤ 0 refers to the vector of constraints to be satisfied. This might gather stage constraints over the prediction horizon as well as terminal constraints at the end of the prediction horizon.…”
Section: Stochastic Model Predictive Controlmentioning
confidence: 99%
See 2 more Smart Citations
“…where the expectation in (19) and the probability in (20) refer to the realization of the uncertainties/disturbance vector profile w. The condensed expression g(u, x k , w) ≤ 0 refers to the vector of constraints to be satisfied. This might gather stage constraints over the prediction horizon as well as terminal constraints at the end of the prediction horizon.…”
Section: Stochastic Model Predictive Controlmentioning
confidence: 99%
“…This is clearly shown on Figure 1 through the notation u 0 , u 1 , u 2 and u 3 . Note that the expression ( 26) is supposed to be an approximation of the expected cost as defined by (19). Similar weighted constraints can be similarly defined in order to replace (20).…”
Section: Uncertainties Clustering-based Solutionsmentioning
confidence: 99%
See 1 more Smart Citation
“…Therefore, we can iteratively sample F x k from the conditional probabilities P Fx k |Fx k−1 ,...,Fx 0 , such that the samples satisfy (7). Furthermore, we can pursue the same procedure to obtain samples for the observation model {H(x, ψ)} x∈X due to the equivalence of both sampling problems.…”
Section: B Trajectory Sampling From Conditional Distributionsmentioning
confidence: 99%
“…For example, the consideration of the uncertainty of learned models leads to cautiousness in model predictive control [4], and can be efficiently implemented using scenario approaches [5]. Furthermore, uncertainty awareness in model-based reinforcement learning automatically balances system identification and task execution [6], and allows to anticipate the effect of online learning during policy optimization [7].…”
Section: Introductionmentioning
confidence: 99%