2022
DOI: 10.1093/imaiai/iaac026
|View full text |Cite
|
Sign up to set email alerts
|

On the robustness to adversarial corruption and to heavy-tailed data of the Stahel–Donoho median of means

Abstract: We consider median of means (MOM) versions of the Stahel–Donoho outlyingness (SDO) [ 23, 66] and of the Median Absolute Deviation (MAD) [ 30] functions to construct subgaussian estimators of a mean vector under adversarial contamination and heavy-tailed data. We develop a single analysis of the MOM version of the SDO which covers all cases ranging from the Gaussian case to the $L_2$ case. It is based on isomorphic and almost isometric properties of the MOM versions of SDO and MAD. This analysis also covers cas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 42 publications
0
5
0
Order By: Relevance
“…That is, one aims to construct an estimator µ such that ( µ−µ) T Σ −1 ( µ−µ) is small with high probability. It appears that the bounds with respect to this norm are necessarily dimension-dependent, and a simple VC-type/sphere covering argument is sufficient to obtain the optimal rates of convergence [DL22]. Similar observations are also valid for the covariance estimation problem.…”
Section: Robust Multivariate Mean Estimationmentioning
confidence: 58%
See 3 more Smart Citations
“…That is, one aims to construct an estimator µ such that ( µ−µ) T Σ −1 ( µ−µ) is small with high probability. It appears that the bounds with respect to this norm are necessarily dimension-dependent, and a simple VC-type/sphere covering argument is sufficient to obtain the optimal rates of convergence [DL22]. Similar observations are also valid for the covariance estimation problem.…”
Section: Robust Multivariate Mean Estimationmentioning
confidence: 58%
“…Another interesting aspect of our analysis is that we do not make any assumptions on the sample size. In comparison, the existing estimators that recover the optimal bound in the isotropic case [CGR18,DK19] require N c(d + log(1/δ)), or N cdε −2 as in [DL22], where c > 0 is some absolute constant. We will encounter some weaker assumptions in Section 6, but only when tuning a single real-valued parameter of our estimator.…”
Section: Robust Multivariate Mean Estimationmentioning
confidence: 99%
See 2 more Smart Citations
“…[9] studied the non-asymptotic concentration of the heteroskedastic Wishart-type matrices; Ref. [10] constructed sub-Gaussian estimators of a mean vector under adversarial contamination and heavy-tailed data by Median-of-Mean versions of the Stahel-Donoho outlyingness and of Median Absolute Deviation functions; Ref. [11] obtained the deconvolution for some singular density errors via a combinatorial Median-of-Mean approach and assessed the estimator quality by establishing non-asymptotic risk bounds.…”
Section: Introductionmentioning
confidence: 99%