2021
DOI: 10.48550/arxiv.2102.01514
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Metrics and continuity in reinforcement learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 0 publications
0
3
0
Order By: Relevance
“…The idea to estimate the explanation element (E1) is to find the feature-sets of similar states that frequently appear conditioned on a particular action a determined by the policy π. That is, for a given state s for which users are asking for an explanation, we compute the appearance frequency of features in similar states to s. We chose to use a value-based metric since it's simple and can be approximated to reduce its computational cost [26]. Our similarity metric groups states that have a similar value v to the reference v ref erence within the range 1.0 ± 0.05 × v ref erence .…”
Section: A Generating Explanationsmentioning
confidence: 99%
“…The idea to estimate the explanation element (E1) is to find the feature-sets of similar states that frequently appear conditioned on a particular action a determined by the policy π. That is, for a given state s for which users are asking for an explanation, we compute the appearance frequency of features in similar states to s. We chose to use a value-based metric since it's simple and can be approximated to reduce its computational cost [26]. Our similarity metric groups states that have a similar value v to the reference v ref erence within the range 1.0 ± 0.05 × v ref erence .…”
Section: A Generating Explanationsmentioning
confidence: 99%
“…Trivially any problem can be embedded in a metric space where the metric is taken to be the difference of values of the optimal Q function. Recent work has investigated the options of selecting a metric in terms of its induced topological structure on the space [28].…”
Section: Metric Space and Lipschitz Assumptionsmentioning
confidence: 99%
“…This assumes access to the similarity metrics. Learning the metric (or picking the metric) is important in practice, but beyond the scope of this paper [62,28]. Assumption 3.…”
Section: Metric Space and Lipschitz Assumptionsmentioning
confidence: 99%