2019 IEEE 58th Conference on Decision and Control (CDC) 2019
DOI: 10.1109/cdc40024.2019.9029621
|View full text |Cite
|
Sign up to set email alerts
|

A Tutorial on Concentration Bounds for System Identification

Abstract: We provide a brief tutorial on the use of concentration inequalities as they apply to system identification of state-space parameters of linear time invariant systems, with a focus on the fully observed setting. We draw upon tools from the theories of large-deviations and self-normalized martingales, and provide both data-dependent and independent bounds on the learning rate.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
23
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 33 publications
(23 citation statements)
references
References 14 publications
0
23
0
Order By: Relevance
“…This tutorial paper and our companion paper [7] presented a broad overview of recent progress towards the finitetime analysis for reinforcement learning and adaptive control methods. We have attempted to provide a summary of representative results in this space that establish connections between the adaptive control literature and methods recently proposed in reinforcement learning.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…This tutorial paper and our companion paper [7] presented a broad overview of recent progress towards the finitetime analysis for reinforcement learning and adaptive control methods. We have attempted to provide a summary of representative results in this space that establish connections between the adaptive control literature and methods recently proposed in reinforcement learning.…”
Section: Discussionmentioning
confidence: 99%
“…We combine the techniques described in [7] with robust and optimal control to derive finite-time guarantees for the optimal LQR control of an unknown system. We partition our study according to three initial uncertainty regimes: (i) completely unknown (A, B), (ii) moderate error bounds under which CE control may fail, and (iii) small error bounds under which CE control is stabilizing.…”
Section: Model-based Methods For Lqrmentioning
confidence: 99%
See 2 more Smart Citations
“…The connection of RL with optimal control is known since ever (see, e.g., [15,38,63]). According to [48,49] tools stemming from advanced control theory should enhance RL in general.…”
Section: Introductionmentioning
confidence: 99%