2021
DOI: 10.1007/s10489-021-02787-4
|View full text |Cite
|
Sign up to set email alerts
|

WDIBS: Wasserstein deterministic information bottleneck for state abstraction to balance state-compression and performance

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 24 publications
0
1
0
Order By: Relevance
“…Traditional RL uses interaction with the environment to continuously conduct trial and error to optimize the strategy [31,32]. The most famous theoretical framework of HRL is the options framework (as shown in Figure 1) [14,33]. The option is a three-element tuple (I, π, β ), where π : S → [0, 1] represents the strategy, which is a probability distribution function based on the state space and the motion space; β : S → [0, 1] is the termination condition, β (s) indicates that the state s has the probability of β (s) to terminate and exit the current option; I ⊆ S indicates the initial state of the option.…”
Section: Entities and Relations Extractionmentioning
confidence: 99%
“…Traditional RL uses interaction with the environment to continuously conduct trial and error to optimize the strategy [31,32]. The most famous theoretical framework of HRL is the options framework (as shown in Figure 1) [14,33]. The option is a three-element tuple (I, π, β ), where π : S → [0, 1] represents the strategy, which is a probability distribution function based on the state space and the motion space; β : S → [0, 1] is the termination condition, β (s) indicates that the state s has the probability of β (s) to terminate and exit the current option; I ⊆ S indicates the initial state of the option.…”
Section: Entities and Relations Extractionmentioning
confidence: 99%