2020 IEEE Conference on Big Data and Analytics (ICBDA) 2020
DOI: 10.1109/icbda50157.2020.9289798
|View full text |Cite
|
Sign up to set email alerts
|

Anomaly Detection with Machine Learning in the Presence of Extreme Value - A Review Paper

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 31 publications
0
6
0
Order By: Relevance
“…The data used for the current work is an assimilation of data recorded onboard 7000 TEU 6 post-panamax container ship and weather hindcast data obtained from one of the metocean data repositories. The onboard recorded data samples are obtained as 15-minute averaged values using an onboard installed energy management web application, called Marorka Online.…”
Section: Data Exploration and Processingmentioning
confidence: 99%
See 1 more Smart Citation
“…The data used for the current work is an assimilation of data recorded onboard 7000 TEU 6 post-panamax container ship and weather hindcast data obtained from one of the metocean data repositories. The onboard recorded data samples are obtained as 15-minute averaged values using an onboard installed energy management web application, called Marorka Online.…”
Section: Data Exploration and Processingmentioning
confidence: 99%
“…However, most of these techniques detect outliers by taking into account the distribution of the data in high dimensional variable (or feature) space, paying not much attention towards the correlation between the variables. Such a technique may cause more harm than good as it would result in detecting extreme values (like extreme weather observations) as well as rare event samples as outliers, which would result in poor predictions in extreme or rare conditions using the models calibrated on the cleaned datasets, as concluded by Suboh and Aziz [6]. Moreover, in case of an unbalanced dataset, the data samples present in the sparse regions of high dimensional variable space would, probably, also suffer the same fate as extreme or rare events, resulting in the loss of valuable information.…”
Section: Introductionmentioning
confidence: 99%
“…On the contrary, Statisticians developed various algorithms for anomaly detection, but most of the techniques only apply to univariate cases [27]. The process of determining anomaly is more complicated in multivariate datasets compared to univariate datasets.…”
Section: B Anomaly Detection In High-dimensional and Multivariate Datamentioning
confidence: 99%
“…If these data are misjudged as any kind of data, training neural network with valid data will greatly reduce the classification effect of neural network. Therefore, outlier elimination algorithm is used to delete these few points [11]. Concrete method is the first through the principal component analysis for data dimension reduction, the data for dimension reduction after clustering, clustering center here first, respectively defined as the average of the two types of data, clustering is completed, remove from the far point of clustering center and clustering results are inconsistent with the original label, of eliminating outliers is completed.…”
Section: Data Processing Modulementioning
confidence: 99%