2019
DOI: 10.1109/tit.2019.2917669
|View full text |Cite
|
Sign up to set email alerts
|

Data Discovery and Anomaly Detection Using Atypicality: Theory

Abstract: A central question in the era of 'big data' is what to do with the enormous amount of information. One possibility is to characterize it through statistics, e.g., averages, or classify it using machine learning, in order to understand the general structure of the overall data. The perspective in this paper is the opposite, namely that most of the value in the information in some applications is in the parts that deviate from the average, that are unusual, atypical. We define what we mean by 'atypical' in an ax… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
42
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2

Relationship

3
3

Authors

Journals

citations
Cited by 16 publications
(42 citation statements)
references
References 58 publications
0
42
0
Order By: Relevance
“…The issue is that there are an astonishing large number of such trees; for example for there are 676 such trees. Instead of choosing the best, we can use the idea of the CTW [ 1 , 54 , 55 ] and weigh in each node: Suppose after passing a signal of an internal node S through low-pass and high-pass filters and downsampler, and are produced in the children nodes of S . The weighted probability of in the internal node S will be which is a good coding distribution for both a memoryless source and a source with memory [ 54 , 55 ].…”
Section: Scalar Signal Processing Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…The issue is that there are an astonishing large number of such trees; for example for there are 676 such trees. Instead of choosing the best, we can use the idea of the CTW [ 1 , 54 , 55 ] and weigh in each node: Suppose after passing a signal of an internal node S through low-pass and high-pass filters and downsampler, and are produced in the children nodes of S . The weighted probability of in the internal node S will be which is a good coding distribution for both a memoryless source and a source with memory [ 54 , 55 ].…”
Section: Scalar Signal Processing Methodsmentioning
confidence: 99%
“…We therefore need universal methods. In the paper [ 1 ] we developed a methodology, atypicality, that can be used to discover such data. The basic idea is that if some data can be encoded with a shorter codelength in itself, i.e., with a universal source coder, rather than using the optimum coder for typical data, then it is atypical.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…The papers [19], [20] show that atypicality has many desirable theoretical properties and that it works experimentally for sequences. Specifically for anomaly detection, the paper [20] shows that atypicality is (asymptotically) optimum for finite state machine (FSM) We will say that two FSM are distinct if they have no identical classes.…”
Section: Anomaly Detectionmentioning
confidence: 99%