2021
DOI: 10.48550/arxiv.2111.10476
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Towards Return Parity in Markov Decision Processes

Abstract: Algorithmic decisions made by machine learning models in high-stakes domains may have lasting impacts over time. Unfortunately, naive applications of standard fairness criterion in static settings over temporal domains may lead to delayed and adverse effects. To understand the dynamics of performance disparity, we study a fairness problem in Markov decision processes (MDPs). Specifically, we propose return parity, a fairness notion that requires MDPs from different demographic groups that share the same state … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 25 publications
0
3
0
Order By: Relevance
“…However, dynamic effects of the fair decisions on evolving populations is a far less studied area. Recently, a series of papers, among which [9,11,12,15,19,21,35,47,49], have started to investigate these dynamic effects, e.g., by modeling decisions and dynamics as Markov Decision Processes and by using reinforcement learning as well as dynamic programming to design fair policies and algorithms. One of the main messages of these works is that fair decisions in a static context, may not be fair in a dynamic scenario, where populations and disadvantage groups evolve in response to the decisions taken at previous time periods.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…However, dynamic effects of the fair decisions on evolving populations is a far less studied area. Recently, a series of papers, among which [9,11,12,15,19,21,35,47,49], have started to investigate these dynamic effects, e.g., by modeling decisions and dynamics as Markov Decision Processes and by using reinforcement learning as well as dynamic programming to design fair policies and algorithms. One of the main messages of these works is that fair decisions in a static context, may not be fair in a dynamic scenario, where populations and disadvantage groups evolve in response to the decisions taken at previous time periods.…”
Section: Related Workmentioning
confidence: 99%
“…Remark 5.1. This is quite interesting for funding allocation, but also in vaccine allocations over multiple periods, when one wants to make sure that the vaccination uptake is equivalent among neighboring countries (to allow for safe travel), and the global average uptake is as high as possible 9 . Figure 5 seems to indicate that the proportional-to-population strategy may not be as effective to achieve equitability as more targeted policies based on actual uptakes, and the countries' capacity to turn vaccine vials into vaccinated people.…”
Section: Changing Costmentioning
confidence: 99%
See 1 more Smart Citation