2004
DOI: 10.1023/b:dami.0000023676.72185.7c
|View full text |Cite
|
Sign up to set email alerts
|

On-Line Unsupervised Outlier Detection Using Finite Mixtures with Discounting Learning Algorithms

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
263
0

Year Published

2009
2009
2021
2021

Publication Types

Select...
6
3

Relationship

1
8

Authors

Journals

citations
Cited by 418 publications
(263 citation statements)
references
References 21 publications
0
263
0
Order By: Relevance
“…Data sets included are: a data generator Mulcross 4 [23] which is designed to evaluate anomaly detectors, and three other anomaly detection data sets from UCI repository [4]: http, Annthyroid and Dermatology. Previous usage can be found in [28,23,15]. Http is the largest subset from KDD CUP 99 network intrusion data [28]; attack instances are treated as anomalies.…”
Section: Performance On Data Sets Containing Only Clustered Anomaliesmentioning
confidence: 99%
See 1 more Smart Citation
“…Data sets included are: a data generator Mulcross 4 [23] which is designed to evaluate anomaly detectors, and three other anomaly detection data sets from UCI repository [4]: http, Annthyroid and Dermatology. Previous usage can be found in [28,23,15]. Http is the largest subset from KDD CUP 99 network intrusion data [28]; attack instances are treated as anomalies.…”
Section: Performance On Data Sets Containing Only Clustered Anomaliesmentioning
confidence: 99%
“…A publicly available example of clustered anomalies can be found in KDDCUP 1999 data set 1 , where bursts of attacks (clustered anomalies) can be observed in a subset known as http [28] as shown in Figure 1. Three bursts of attacks are clustered, first in the middle of the data stream; and two smaller ones appeared at the end of the stream.…”
Section: Introductionmentioning
confidence: 99%
“…This approach is used in [13], where the authors describe an algorithm called SmartSifter that uses this idea to calculate outlier scores for each instance. The outlier score of an instance is defined as the Hellinger distance between two probability distributions of available data: one built for the whole data set and the other one built for the whole data set without the observed instance.…”
Section: Related Workmentioning
confidence: 99%
“…In [7], numerous discordancy tests are discussed for different scenarios. In [9] [10], authors propose SmartSifter (SS), which is an on-line real-time outlier detection algorithm. The basic principle of SS is to use a probabilistic model (a finite mixture model) to represent the underlying distribution of a given data set.…”
Section: Related Workmentioning
confidence: 99%