2021
DOI: 10.48550/arxiv.2107.07455
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks

Abstract: There has been significant research done on developing methods for improving robustness to distributional shift and uncertainty estimation. In contrast, only limited work has examined developing standard datasets and benchmarks for assessing these approaches. Additionally, most work on uncertainty estimation and robustness has developed new techniques based on small-scale regression or image classification tasks. However, many tasks of practical interest have different modalities, such as tabular data, audio, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(21 citation statements)
references
References 38 publications
0
21
0
Order By: Relevance
“…The Vehicle Motion prediction part of Shifts Dataset [16] contains 5 seconds of past and 5 seconds of future states for all agents in a scene along with overall scene features. The goal of the challenge is to build a model that predicts k ≤ 5 future trajectories ỹk i in the horizon of T = 25 timesteps along with their confidences ω k and overall scene uncertainty U for each scene x i .…”
Section: Problem Statementmentioning
confidence: 99%
“…The Vehicle Motion prediction part of Shifts Dataset [16] contains 5 seconds of past and 5 seconds of future states for all agents in a scene along with overall scene features. The goal of the challenge is to build a model that predicts k ≤ 5 future trajectories ỹk i in the horizon of T = 25 timesteps along with their confidences ω k and overall scene uncertainty U for each scene x i .…”
Section: Problem Statementmentioning
confidence: 99%
“…We use F1@95 score to jointly evaluate uncertainty and robustness. A good uncertainty measure should achieve low R-AUC, high F1-AUC and high F1@95 scores [13]. These are presented in Table 2 .…”
Section: Ll-fisher Uncertainty (Ll-fu)mentioning
confidence: 99%
“…Most of the available datasets such as Imagenet -C [3], A [5], R [4], O [5] and WILDS [7] focus primarily on image classification tasks. The recently introduced Shifts Dataset [13] provides a favourable data setting. It is composed of three parts each corresponding to a different data modality: tabular weather prediction data, machine translation data and self-driving car data.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Nowadays, few researchers are surprised by NNs performing very well on some in-domain data distribution, and there has been increasing interest in developing models that are robust to domain shifts [8,9,10,11]. Here we focus on the recently proposed benchmark for evaluating domain robust systems, WILDS [11], and we share our empirical experience with two datasets of WILDS, iWildCam and FMoW, as well as their baseline models.…”
Section: Introductionmentioning
confidence: 99%