Wasserstein barycenters and variance-like criteria based on the Wasserstein distance are used in many problems to analyze the homogeneity of collections of distributions and structural relationships between the observations. We propose the estimation of the quantiles of the empirical process of Wasserstein's variation using a bootstrap procedure. We then use these results for statistical inference on a distribution registration model for general deformation functions.The tests are based on the variance of the distributions with respect to their Wasserstein's barycenters for which we prove central limit theorems, including bootstrap versions.In cases where G is a parametric class, estimation of the warping functions is studied in [2]. However, estimation/registration procedures may lead to inconsistent conclusions if the chosen deformation class G is too small. It is, therefore, important to be able to assess the fit to the deformation model given by a particular choice of G. This is the main goal of this paper. We note that within this framework, statistical inference on deformation models for distributions has been studied first in [20]. Here we provide a different approach which allows to deal with more general deformation classes.The pioneering works [15,25] study the existence of relationships between distributions F and G by using a discrepancy measure ∆(F, G) between them which is built using the Wasserstein distance. The authors consider the assumption H 0 : ∆(F, G) > ∆ 0 versus H a : ∆(F, G) ≤ ∆ 0 for a chosen threshold ∆ 0 . Thus when the null hypothesis is rejected, there is statistical evidence that the two distributions are similar with respect to the chosen criterion. In this same vein, we define a notion of variation of distributions using the Wasserstein distance, W r , in the set W r (R d ) of probability measures with finite rth moments, where r ≥ 1. This notion generalizes the concept of variance for random distributions over R d . This quantity can be defined aswhich measures the spread of the distributions. Then, to measure closeness to a deformation model, we take a look at the minimal variation among warped distributions, a quantity that we could consider as a minimal alignment cost. Under some mild conditions, a deformation model holds if and only if this minimal alignment cost is null and we can base our assessment of a deformation model on this quantity. As in [15,25], we provide results (a Central Limit Theorem and bootstrap versions) that enable to reject that the minimal alignment cost exceeds some threshold, and hence to conclude that it is below that threshold. Our results are given in a setup of general, nonparametric classes of warping functions. We also provide results in the somewhat more restrictive setup where one is interested in the more classical goodness-of-fit problem for the deformation model. Note that a general Central Limit Theorem is available for the Wasserstein distance in [18].The paper is organized as follows. The main facts about Wasserstein variation are present...
We consider a parametric deformation model for distributions. More precisely, we assume we observe J samples of random variables which are warped from an unknown distribution template. We tackle in this paper the problem of estimating the individual deformation parameters. For this, we construct a registering criterion based on the Wasserstein distance to quantify the alignment of the distributions. We prove consistency of the empirical estimators.
We provide in this paper a fully adaptive penalized procedure to select a covariance among a collection of models observing i.i.d replications of the process at fixed observation points. For this we generalize the results of [3] and propose to use a data driven penalty to obtain an oracle inequality for the estimator. We prove that this method is an extension to the matricial regression model of the work by Baraud in [1].
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.