2017 IEEE International Conference on Data Science and Advanced Analytics (DSAA) 2017
DOI: 10.1109/dsaa.2017.35
|View full text |Cite
|
Sign up to set email alerts
|

SECODA: Segmentation- and Combination-Based Detection of Anomalies

Abstract: This study introduces SECODA, a novel generalpurpose unsupervised non-parametric anomaly detection algorithm for datasets containing continuous and categorical attributes. The method is guaranteed to identify cases with unique or sparse combinations of attribute values. Continuous attributes are discretized repeatedly in order to correctly determine the frequency of such value combinations. The concept of constellations, exponentially increasing weights and discretization cut points, as well as a pruning heuri… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
14
0

Year Published

2018
2018
2025
2025

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 8 publications
(14 citation statements)
references
References 42 publications
0
14
0
Order By: Relevance
“…On each repetition, the low-frequency data are identified. The outliers are extreme value data which have low frequency on each repetition [13].…”
Section: A Categorized Outlier Detection Methods For Mixed-attribute Datamentioning
confidence: 99%
See 1 more Smart Citation
“…On each repetition, the low-frequency data are identified. The outliers are extreme value data which have low frequency on each repetition [13].…”
Section: A Categorized Outlier Detection Methods For Mixed-attribute Datamentioning
confidence: 99%
“…The outlier value is determined by using the saliency degree, which is calculated from the relationships between data. [13] Interval Numerical data are transformed into categorical data using a specific interval length. The transformation is repeated using different interval lengths.…”
Section: A Categorized Outlier Detection Methods For Mixed-attribute Datamentioning
confidence: 99%
“…Researchers should at least aim to include the most important types, based on the domain or the problem being studied. See [1] for an example of an evaluation. Local vs. global anomalies.…”
Section: Evaluation Of Algorithmsmentioning
confidence: 99%
“…Anomalies are cases that are in some way unusual and do not appear to fit the general patterns present in the dataset [1,2,3]. Such cases are often also referred to as outliers, novelties or deviant observations [3,4].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation