2021
DOI: 10.48550/arxiv.2105.14709
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Joint Stabilization and Regret Minimization through Switching in Over-Actuated Systems (extended version)

Abstract: Adaptively controlling and minimizing regret in unknown dynamical systems while controlling the growth of the system state is crucial in real-world applications. In this work, we study the problem of stabilizing and regret minimization of linear dynamical systems with system-level actuator redundancy. We propose an optimism-based algorithm that utilizes the actuator redundancy and the possibility of switching between actuating modes to guarantee the boundedness of the state. This is in contrast to the prior wo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…As such, the algorithms update the estimate and policy when the condition det(V j t ) > 2 det(V j τ ) is fulfilled where τ is the last time the policy got updated (τ is solely used in Algorithm 2 and should not be mixed with switch times in actuating modes; τ i s). This is a standard step in OFU-based algorithms ( [29], [32], and [33]).…”
Section: Control Designmentioning
confidence: 99%
See 2 more Smart Citations
“…As such, the algorithms update the estimate and policy when the condition det(V j t ) > 2 det(V j τ ) is fulfilled where τ is the last time the policy got updated (τ is solely used in Algorithm 2 and should not be mixed with switch times in actuating modes; τ i s). This is a standard step in OFU-based algorithms ( [29], [32], and [33]).…”
Section: Control Designmentioning
confidence: 99%
“…and V i [0 t] and Θi t is given by (32,33). On the other when the projection is not applied the confidence set is given as follows:…”
Section: Stability Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…In this fashion, (Ibrahimi et al, 2012) introduced O(p √ T ) regret algorithm with state-space dimension of p. (Lale et al, 2022) applied the OFU principle and, through imposing extra exploratory noise, accelerated the stabilization of the system. (Chekan et al, 2021) introduced an OFU-based strategy for joint stabilization and regret minimization of over-actuated systems. (Cohen et al, 2019) formulated the OFU-based approach in a relaxed semi-definite program (SDP) and, as such, proposed a computationally efficient counterpart of the algorithm from (Abbasi-Yadkori & Szepesvári, 2011), which involves solving a non-convex optimization program.…”
Section: Introductionmentioning
confidence: 99%