2021
DOI: 10.48550/arxiv.2106.01048
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Expected Scalarised Returns Dominance: A New Solution Concept for Multi-Objective Decision Making

Conor F. Hayes,
Timothy Verstraeten,
Diederik M. Roijers
et al.

Abstract: In many real-world scenarios, the utility of a user is derived from the single execution of a policy. In this case, to apply multi-objective reinforcement learning, the expected utility of the returns must be optimised. Various scenarios exist where a user's preferences over objectives (also known as the utility function) are unknown or difficult to specify. In such scenarios, a set of optimal policies must be learned. However, settings where the expected utility must be maximised have been largely overlooked … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 27 publications
(62 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?