2008
DOI: 10.1007/s11222-008-9096-5
|View full text |Cite
|
Sign up to set email alerts
|

Controlling the size of multivariate outlier tests with the MCD estimator of scatter

Abstract: Multivariate outlier detection requires computation of robust distances to be compared with appropriate cut-off points. In this paper we propose a new calibration method for obtaining reliable cut-off points of distances derived from the MCD estimator of scatter. These cut-off points are based on a more accurate estimate of the extreme tail of the distribution of robust distances. We show that our procedure gives reliable tests of outlyingness in almost all situations of practical interest, provided that the s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
19
0

Year Published

2010
2010
2018
2018

Publication Types

Select...
4
3
1

Relationship

3
5

Authors

Journals

citations
Cited by 32 publications
(20 citation statements)
references
References 22 publications
1
19
0
Order By: Relevance
“…Therefore, we follow the approach of Cerioli et al . (), and we resort to a different Monte Carlo approximation of ςγ2 based on pooling all the n * = n × K estimates ŝ[i]2(k) in a single sample Ŝ=ŝ[1]2(1),,ŝ[1]2(K),ŝ[2]2(1),,ŝ[2]2(K),,ŝ[n]2(1),,ŝ[n]2(K). Motivated by convergence results (here as n * → ∞ ) for quantiles, even in the case of non‐independent observations (see, e.g. Korosok, ), we take the γ ‐th quantile of the pooled sample Ŝ as our estimate of ςγ2, that is, trueς̂γ2=Ŝ[l], where Ŝ[l] is the l ‐th ordered value in Ŝ and l is defined as in , with n replaced by n * .…”
Section: Reliable Robust Diagnosticsmentioning
confidence: 99%
See 2 more Smart Citations
“…Therefore, we follow the approach of Cerioli et al . (), and we resort to a different Monte Carlo approximation of ςγ2 based on pooling all the n * = n × K estimates ŝ[i]2(k) in a single sample Ŝ=ŝ[1]2(1),,ŝ[1]2(K),ŝ[2]2(1),,ŝ[2]2(K),,ŝ[n]2(1),,ŝ[n]2(K). Motivated by convergence results (here as n * → ∞ ) for quantiles, even in the case of non‐independent observations (see, e.g. Korosok, ), we take the γ ‐th quantile of the pooled sample Ŝ as our estimate of ςγ2, that is, trueς̂γ2=Ŝ[l], where Ŝ[l] is the l ‐th ordered value in Ŝ and l is defined as in , with n replaced by n * .…”
Section: Reliable Robust Diagnosticsmentioning
confidence: 99%
“…This tendency was first noted by Cook & Hawkins (), although in a somewhat biased context, and then confirmed in subsequent studies, even when ad hoc finite‐sample corrections are taken into account; see, for example, Cerioli et al . (), Fauconnier & Haesbroeck (), Cerioli et al . (), Lourenco & Pires () and Riani et al .…”
Section: Introductionmentioning
confidence: 95%
See 1 more Smart Citation
“…In fact, this was the view behind the seminal work of Wilks (1963). More recent appreciation of this requirement can be found in the techniques developed by Hadi (1994), Becker and Gather (1999), García-Escudero and Gordaliza (2005), Hardin and Rocke (2005) and Cerioli, Riani, and Atkinson (2009).…”
Section: One Multivariate Samplementioning
confidence: 99%
“…We also provided some corrections in the case of multivariate data, where Mahalanobis distances replace scaled residuals. Our improved methods involve both distributional results for finite samples and powerful ways of dealing with multiple outliers tests (Cerioli et al , ; Cerioli, ; Cerioli and Farcomeni, ). However, we acknowledge that a unified asymptotic theory of multivariate outlier detection has yet to come.…”
mentioning
confidence: 99%