2020
DOI: 10.1142/s0219649220400134
|View full text |Cite
|
Sign up to set email alerts
|

Outlier Detection in High Dimensional Data

Abstract: High-dimensional data poses unique challenges in outlier detection process. Most of the existing algorithms fail to properly address the issues stemming from a large number of features. In particular, outlier detection algorithms perform poorly on data set of small size with a large number of features. In this paper, we propose a novel outlier detection algorithm based on principal component analysis and kernel density estimation. The proposed method is designed to address the challenges of dealing with high-d… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4
1

Relationship

6
4

Authors

Journals

citations
Cited by 30 publications
(15 citation statements)
references
References 30 publications
0
15
0
Order By: Relevance
“…In this paper, we employ a kernel density-based method to identify the outliers in a dataset. Kernel density estimation (KDE) is a well-established statistical method that is widely used in different applications [ 41 ]. It is a nonparametric technique for estimating the underlying distribution of the sample data using a kernel function.…”
Section: Methodsmentioning
confidence: 99%
“…In this paper, we employ a kernel density-based method to identify the outliers in a dataset. Kernel density estimation (KDE) is a well-established statistical method that is widely used in different applications [ 41 ]. It is a nonparametric technique for estimating the underlying distribution of the sample data using a kernel function.…”
Section: Methodsmentioning
confidence: 99%
“…KDE is a well-known nonparametric density estimator used in number of applications [7], [9]. The KDE-based sampling approach is based on estimating the underlying distribution of the minority data.…”
Section: Methodology a Sampling Methodsmentioning
confidence: 99%
“…A common approach to address class imbalance is to balance the data through resampling [5]. Alternatively, outlier detection methods can be used in place of the standard classification algorithms [6].…”
Section: Literaturementioning
confidence: 99%