Decision Theory Models for Applications in Artificial Intelligence 2012
DOI: 10.4018/978-1-60960-165-2.ch007
|View full text |Cite
|
Sign up to set email alerts
|

Automatically Generated Explanations for Markov Decision Processes

Abstract: Explaining policies of Markov Decision Processes (MDPs) is complicated due to their probabilistic and sequential nature. We present a technique to explain policies for factored MDPs by populating a set of domain-independent templates. We also present a mechanism to determine a minimal set of templates that, viewed together, completely justify the policy. These explanations can be generated automatically at run-time with no additional effort required from the MDP designer. We demonstrate this technique using th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
50
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 33 publications
(50 citation statements)
references
References 16 publications
0
50
0
Order By: Relevance
“…An example of an explanation of this form, in which only one rewarding scenario need be considered is, taken verbatim from [Khan et al 2009]: Action TakeCS343 & CS448 is the best action because: It is likely to take you to CoursesCompleted = 6, TermNumber = Final about 0.86 times, which is as high as any other action.…”
Section: Explanation In Recommender Systemsmentioning
confidence: 99%
See 4 more Smart Citations
“…An example of an explanation of this form, in which only one rewarding scenario need be considered is, taken verbatim from [Khan et al 2009]: Action TakeCS343 & CS448 is the best action because: It is likely to take you to CoursesCompleted = 6, TermNumber = Final about 0.86 times, which is as high as any other action.…”
Section: Explanation In Recommender Systemsmentioning
confidence: 99%
“…While this particular explanation consists of only a single template, in general the full explanation for π * (s) consists of the minimum number of templates required to explain some portion of the utility, V MSE , such that Q(s, π * (s)) ≥ V MSE > Q(s, a), ∀a = π * (s) [Khan et al 2011]. We refer to an approach such as this, which focuses on explaining some suitable portion of the utility function, as a "most coverage" approach.…”
Section: Explanation In Recommender Systemsmentioning
confidence: 99%
See 3 more Smart Citations