2020
DOI: 10.48550/arxiv.2011.13101
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Regret Bounds for Adaptive Nonlinear Control

Abstract: We study the problem of adaptively controlling a known discrete-time nonlinear system subject to unmodeled disturbances. We prove the first finite-time regret bounds for adaptive nonlinear control with matched uncertainty in the stochastic setting, showing that the regret suffered by certainty equivalence adaptive control, compared to an oracle controller with perfect knowledge of the unmodeled disturbances, is upper bounded by O( √ T ) in expectation. Furthermore, we show that when the input is subject to a k… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 22 publications
0
4
0
Order By: Relevance
“…In this context, we could utilize Control Contraction Metrics (CCMs) [33,[77][78][79][80][81] for extending contraction theory to the systematic design of differential feedback control δu = k(x, δx, u, t) via convex optimization, achieving greater generality at the expense of computational efficiency in obtaining u. Applications of the CCM to estimation, adaptive control, and motion planning are discussed in [82], [83][84][85], and [78,[86][87][88][89], respectively, using geodesic distances between trajectories [49]. It is also worth noting that the objective function of CV-STEM has the condition number of a positive definite matrix that defines a contraction metric as one of its arguments, rendering it applicable and effective even to machine learning-based automatic control frameworks as shall be seen in Sec.…”
Section: Construction Of Contraction Metrics (Sec 3-4)mentioning
confidence: 99%
See 2 more Smart Citations
“…In this context, we could utilize Control Contraction Metrics (CCMs) [33,[77][78][79][80][81] for extending contraction theory to the systematic design of differential feedback control δu = k(x, δx, u, t) via convex optimization, achieving greater generality at the expense of computational efficiency in obtaining u. Applications of the CCM to estimation, adaptive control, and motion planning are discussed in [82], [83][84][85], and [78,[86][87][88][89], respectively, using geodesic distances between trajectories [49]. It is also worth noting that the objective function of CV-STEM has the condition number of a positive definite matrix that defines a contraction metric as one of its arguments, rendering it applicable and effective even to machine learning-based automatic control frameworks as shall be seen in Sec.…”
Section: Construction Of Contraction Metrics (Sec 3-4)mentioning
confidence: 99%
“…It is demonstrated in [101] that the aNCM is applicable to many types of systems such as robotics systems [5, p. 392], spacecraft high-fidelity dynamics [123,124], and systems modeled by basis function approximation and DNNs [125,126]. Discrete changes could be incorporated in this framework using [85,127].…”
Section: Construction Of Contraction Metrics (Sec 3-4)mentioning
confidence: 99%
See 1 more Smart Citation
“…In recent years, there has been a drive to connect adaptive control methods with techniques from reinforcement learning [7]- [10]. In parallel, methods from reinforcement learning have seen an explosion of work on linear giving precise optimality guaranatees [11]- [13].…”
Section: Introductionmentioning
confidence: 99%