2014
DOI: 10.5120/17026-7318
|View full text |Cite
|
Sign up to set email alerts
|

Comparative Analysis of Outlier Detection Techniques

Abstract: Data Mining simply refers to the extraction of very interesting patterns of the data from the massive data sets. Outlier detection is one of the important aspects of data mining which actually finds out the observations that are deviating from the common expected behavior. Outlier detection and analysis is sometimes known as outlier mining. In this paper, we have tried to provide the broad and a comprehensive literature survey of outliers and outlier detection techniques under one roof, so as to explain the ri… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
25
0
2

Year Published

2018
2018
2023
2023

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(28 citation statements)
references
References 32 publications
1
25
0
2
Order By: Relevance
“…Outlier samples can be error-bound (for example, data entry point, measurement error, experimental error, sampling errors) or have no error, in which case they are called natural outlier. In other words, natural outliers are actually samples that do not make any errors, but their distance from the rest of the samples is considerably large [ 34 ]. There are different ways to identify outlier proteins.…”
Section: Resultsmentioning
confidence: 99%
“…Outlier samples can be error-bound (for example, data entry point, measurement error, experimental error, sampling errors) or have no error, in which case they are called natural outlier. In other words, natural outliers are actually samples that do not make any errors, but their distance from the rest of the samples is considerably large [ 34 ]. There are different ways to identify outlier proteins.…”
Section: Resultsmentioning
confidence: 99%
“…For this reason, it has to be necessary to detect all outliers and to remove all of them from the data set. The methods used to determine the data that are outliers; are separated as statistical, parametric, non-parametric, distance-based, clustering-based and neural network [23]. In this study, the z-Score method which is one of the parametric methods was used.…”
Section: Outlier Rejectionmentioning
confidence: 99%
“…Fiducial cross sections as function of the jet multiplicity and eight differential variables test perturbative QCD under various conditions. Measuring W + and W − production separately, in addition allows to probe valence quark PDFs in a range of the momentum fraction x of the proton which is complementary to inclusive cross section measurements 13 .…”
Section: W Boson Production In Association With Jetsmentioning
confidence: 99%