Abstract Proceedings of the 2022 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of 2022
DOI: 10.1145/3489048.3522658
|View full text |Cite
|
Sign up to set email alerts
|

Robustness and Consistency in Linear Quadratic Control with Untrusted Predictions

Abstract: We study the problem of learning-augmented predictive linear quadratic control. Our goal is to design a controller that balances consistency, which measures the competitive ratio when predictions are accurate, and robustness, which bounds the competitive ratio when predictions are inaccurate. We propose a novel πœ†-confident controller and prove that it maintains a competitive ratio upper bound of 1 + min{𝑂 (πœ† 2 πœ€) + 𝑂 (1 βˆ’ πœ†) 2 , 𝑂 (1) + 𝑂 (πœ† 2 )} where πœ† ∈ [0, 1] is a trust parameter set based on the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(11 citation statements)
references
References 14 publications
0
11
0
Order By: Relevance
“…Before proceeding to our policy, to highlight the challenge of combining modelbased advice with model-free policies in this setting we first consider a simple strategy for combining the two via a convex combination. This is an approach that has been proposed and studied previously, e.g., [7], [21]. However, we show that it can be problematic in that it can yield an unstable policy even when the two policies are stabilizing individually.…”
Section: Warmup: a Naive Convex Combinationmentioning
confidence: 83%
See 3 more Smart Citations
“…Before proceeding to our policy, to highlight the challenge of combining modelbased advice with model-free policies in this setting we first consider a simple strategy for combining the two via a convex combination. This is an approach that has been proposed and studied previously, e.g., [7], [21]. However, we show that it can be problematic in that it can yield an unstable policy even when the two policies are stabilizing individually.…”
Section: Warmup: a Naive Convex Combinationmentioning
confidence: 83%
“…Our results imply an interesting trade-off between stability and optimality, in the sense that if Ξ» is smaller, it is guaranteed to stabilize with a higher rate and if Ξ» becomes larger, it is able to have a smaller competitive ratio bound when provided with a high-quality black-box policy. Different from the linear case, where a cost characterization lemma can be directly applied to bound the difference between the policy costs and optimal costs in terms of the difference between their actions [21], for the case of nonlinear dynamics (1), we introduce an auxiliary linear problem to derive an upper bound on the dynamic regret, whose value can be decomposed into a quadratic term and a term induced by the nonlinearity. The first term can be bounded via a generalized characterization lemma and becomes the model-based bound and model-free error in (2).…”
Section: This Work Single Trajectorymentioning
confidence: 99%
See 2 more Smart Citations
“…Bounds on the competitive ratio has been the focus of many prior works in control [28][29][30][31]. The competitive-ratio control problem has been proposed in [5], where its sub-optimal solution was derived under certain conditions.…”
Section: Introductionmentioning
confidence: 99%