2022
DOI: 10.48550/arxiv.2204.05263
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Maximum entropy optimal density control of discrete-time linear systems and Schrödinger bridges

Abstract: We consider an entropy-regularized version of optimal density control of deterministic discrete-time linear systems. Entropy regularization, or a maximum entropy (MaxEnt) method for optimal control has attracted much attention especially in reinforcement learning due to its many advantages such as a natural exploration strategy. Despite the merits, high-entropy control policies introduce probabilistic uncertainty into systems, which severely limits the applicability of MaxEnt optimal control to safety-critical… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(8 citation statements)
references
References 18 publications
0
8
0
Order By: Relevance
“…Proof. The equalities µ k = µ k and v k = v k follow immediately given that the problem in (33) and the problem in (21) share the constraint in (12a) that enforces the state mean dynamics and in addition, the terms related to the mean dynamics in the objective functions J NLP and J SDP are equal. By the definition of M k in (24), it follows that tr…”
Section: Sdp Formulation For Randomized State Feedback Policymentioning
confidence: 98%
See 4 more Smart Citations
“…Proof. The equalities µ k = µ k and v k = v k follow immediately given that the problem in (33) and the problem in (21) share the constraint in (12a) that enforces the state mean dynamics and in addition, the terms related to the mean dynamics in the objective functions J NLP and J SDP are equal. By the definition of M k in (24), it follows that tr…”
Section: Sdp Formulation For Randomized State Feedback Policymentioning
confidence: 98%
“…, N − 1}. In addition, the objective function J SDP is defined as follows: (21), and let {µ k , v k , Σ k , P k , M k , L} be the minimizer of the SDP in (33). Then,…”
Section: Sdp Formulation For Randomized State Feedback Policymentioning
confidence: 99%
See 3 more Smart Citations