2022
DOI: 10.48550/arxiv.2202.10434
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Empirical Optimal Transport between Different Measures Adapts to Lower Complexity

Abstract: The empirical optimal transport (OT) cost between two probability measures from random data is a fundamental quantity in transport based data analysis. In this work, we derive novel guarantees for its convergence rate when the involved measures are different, possibly supported on different spaces. Our central observation is that the statistical performance of the empirical OT cost is determined by the less complex measure, a phenomenon we refer to as lower complexity adaptation of empirical OT. For instance, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
22
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(23 citation statements)
references
References 45 publications
1
22
0
Order By: Relevance
“…The rate can be refined by replacing the dimension d of the ambient space (e.g. X = R d ) by the intrinsic dimension (such as the Hausdorff dimension) of the input measures [177], and by the lowest intrinsic dimension of the two considered inputs [84]. Considering densities smooth to the order p reduces the complexity to O(N −p/d ) [178], but the estimators (α N , β N ) rely on wavelet decomposition or kernel estimation, which complicates the computation of OT(α N , β N ).…”
Section: Sample Complexitymentioning
confidence: 99%
“…The rate can be refined by replacing the dimension d of the ambient space (e.g. X = R d ) by the intrinsic dimension (such as the Hausdorff dimension) of the input measures [177], and by the lowest intrinsic dimension of the two considered inputs [84]. Considering densities smooth to the order p reduces the complexity to O(N −p/d ) [178], but the estimators (α N , β N ) rely on wavelet decomposition or kernel estimation, which complicates the computation of OT(α N , β N ).…”
Section: Sample Complexitymentioning
confidence: 99%
“…This can be used to estimate OT and its plan. In general, this yields a statistically efficient estimator [Dudley, 1969, Ajtai et al, 1984, Sommerfeld et al, 2019, Weed and Bach, 2019, Hütter and Rigollet, 2021, Manole et al, 2021, Hundrieser et al, 2022 see, however, Niles- Weed and Berthet [2022] for certain improvements. For OT barycenters based on empirical measures significantly less is known, though recently some progress has been made in the context of finitely supported measures [Heinemann et al, 2022b].…”
Section: Statistical Models and Contributionsmentioning
confidence: 99%
“…2 In the naive upper bound based on the triangle inequality the potentially slower rate of the two measures dominates the speed of convergence. However, there are scenarios where the opposite holds true [Hundrieser et al, 2022]. This phenomenon is known as lower complexity adaptation.…”
Section: Barycentersmentioning
confidence: 99%
“…Another interesting question is whether considering the dual formulation of the (p, C)barycenter problem directly and then using tools from empirical process theory would allow to recover a phenomenon similar to the lower complexity adaptation [Hundrieser et al, 2022]. The constants in empirical deviation bounds in this chapter depend on the average of the constants arising from J individual estimation problem.…”
Section: Statistical Modelling For Alternative Uot Modelsmentioning
confidence: 99%