2019
DOI: 10.1007/s10208-019-09426-y
|View full text |Cite
|
Sign up to set email alerts
|

On the Sample Complexity of the Linear Quadratic Regulator

Abstract: This paper addresses the optimal control problem known as the Linear Quadratic Regulator in the case when the dynamics are unknown. We propose a multi-stage procedure, called Coarse-ID control, that estimates a model from a few experimental trials, estimates the error in that model with respect to the truth, and then designs a controller using both the model and uncertainty estimate. Our technique uses contemporary tools from random matrix theory to bound the error in the estimation procedure. We also employ a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

14
449
1
1

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 341 publications
(465 citation statements)
references
References 51 publications
14
449
1
1
Order By: Relevance
“…Proof. We show that any controller in (29) is an internally stabilizing controller in (13). The other direction is similar.…”
Section: A Simplified Parameterizations For State Feedbackmentioning
confidence: 64%
See 1 more Smart Citation
“…Proof. We show that any controller in (29) is an internally stabilizing controller in (13). The other direction is similar.…”
Section: A Simplified Parameterizations For State Feedbackmentioning
confidence: 64%
“…Theorem 1 presents explicit affine mappings between Youla parameterization and IOP: any element in the Youla parameterization (5) corresponds to an element in the IOP (13), and they represent the same controller. The following result presents explicit affine mappings between SLP and IOP.…”
Section: A Explicit Equivalencementioning
confidence: 99%
“…Additionally, the data-driven LQR problem is popular in the machine learning community, where it is typically assumed that the system is influenced by (Gaussian) process noise, see e.g. [51].…”
Section: Future Workmentioning
confidence: 99%
“…This is a classical problem, so there is a very large literature on robust stochastic control and its application to learning-theoretic methods; see e.g. [11,4,17,18,6,5,22,14,2,23]. A rather comprehensive literature review is presented in [13], and we refer the reader to the literature review therein.…”
Section: Literature Review and Contributionsmentioning
confidence: 99%