2017
DOI: 10.1109/tnnls.2016.2526063
|View full text |Cite
|
Sign up to set email alerts
|

A Semisupervised Approach to the Detection and Characterization of Outliers in Categorical Data

Abstract: Abstract-In this paper we introduce a new approach of semi-supervised anomaly detection that deals with categorical data. Given a training set of instances (all belonging to the normal class), we analyze the relationships among features for the extraction of a discriminative characterization of the anomalous instances. Our key idea is to build a model characterizing the features of the normal instances and then use a set of distancebased techniques for the discrimination between the normal and the anomalous in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 41 publications
(19 citation statements)
references
References 29 publications
0
18
0
1
Order By: Relevance
“…To apply the labeled data to detect anomalies, some semisupervised methods [16]- [21] have been proposed. However, these methods focus on learning the behavior patterns of normal data, which require a large amount of labeled normal data for training.…”
Section: B Deep Anomaly Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…To apply the labeled data to detect anomalies, some semisupervised methods [16]- [21] have been proposed. However, these methods focus on learning the behavior patterns of normal data, which require a large amount of labeled normal data for training.…”
Section: B Deep Anomaly Detectionmentioning
confidence: 99%
“…If the information contained in these labeled data can be effectively applied, the accuracy of anomaly detection can be well improved. Therefore, some so-called semisupervised anomaly detection methods [16]- [21] using labeled data have been proposed. However, these methods can only use a large amount of labeled normal data for training, which generates extremely high labeling costs.…”
Section: Introductionmentioning
confidence: 99%
“…Depending on the availability of class labels, there are supervised, semi-supervised, and unsupervised outlier detection methods [8]. Outlier detection methods can also be grouped into different categories [9] like statistical methods, proximity based methods, classification based methods and clustering based methods, depending on the assumptions made on outliers versus normal data.…”
Section: Fig1 Objects Outside the Ellipse Are Outliers And Objectsmentioning
confidence: 99%
“…Li et al [20] proposed a data-analysis method for modeling and predicting the daily electricity consumption in buildings, which can also be used to forecast and detect of abnormal energy use. Ienco et al [21] analyzed the relationship among features to extract a discriminative characterization for anomalous instances. Fiore et al [22] used Discriminative Restricted Boltzmann Machine to combine the expressive power of generative models and infer knowledge from incomplete training data.…”
Section: Related Workmentioning
confidence: 99%