Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD '02 2002
DOI: 10.1145/775048.775050
|View full text |Cite
|
Sign up to set email alerts
|

Scalable robust covariance and correlation estimates for data mining

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
27
0

Year Published

2012
2012
2019
2019

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 24 publications
(27 citation statements)
references
References 0 publications
0
27
0
Order By: Relevance
“…In this paradigm a few cells in a row (case) can be anomalous whereas many other cells in the same row still contain useful information, and in such situations we would rather not remove or downweight the entire row. The cellwise framework was first proposed and studied by Alqallaf et al (2002Alqallaf et al ( , 2009).…”
Section: Fast Detection Of Anomalous Cellsmentioning
confidence: 99%
“…In this paradigm a few cells in a row (case) can be anomalous whereas many other cells in the same row still contain useful information, and in such situations we would rather not remove or downweight the entire row. The cellwise framework was first proposed and studied by Alqallaf et al (2002Alqallaf et al ( , 2009).…”
Section: Fast Detection Of Anomalous Cellsmentioning
confidence: 99%
“…Alternatively, pairwise scatter estimators could be used as fast initial estimator (e.g., Alqallaf et al, 2002). Previous simulation studies have shown that pairwise scatter estimators are robust against cellwise outliers, but they perform not as well in the presence of casewise outliers and finely shaped multivariate data (Danilov et al, 2012;Agostinelli et al, 2015b).…”
Section: Computation Of the Initial Estimatormentioning
confidence: 99%
“…Following the notation of Alqallaf et al (2002Alqallaf et al ( , 2009, we write the cellwise contamination model in the following form:…”
Section: Background and Problem Setupmentioning
confidence: 99%
“…In addition, the MVE, MCD, and SD estimators all require heavy computational effort, rendering them impractical for high-dimensional datasets. To deal with cellwise contamination, Van Aelst (2014) proposed a modified SD estimator that adapts winsorization (Huber, 1981;Alqallaf et al, 2002) and a cellwise weighting scheme. Similar to the original SD estimator, however, computation is only feasible for small p. A recent approach by Agostinelli et al (2014) is capable of dealing with both rowwise and cellwise outliers.…”
Section: Introductionmentioning
confidence: 99%