2021
DOI: 10.31219/osf.io/ren6d
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Using Umap to Inspect Audio Data for Unsupervised Anomaly Detection Under Domain-Shift Conditions

Abstract: The goal of \ac{UAD} is to detect anomalous signals under the condition that only non-anomalous ({\it normal}) data is available beforehand. In UAD under Domain-Shift Conditions (UAD-S), data is further exposed to contextual changes that are usually unknown beforehand. Motivated by the difficulties encountered in the \acs{UAD-S} task presented at the 2021 edition of the \ac{DCASE} challenge\footnote{DCASE website: \url{http://dcase.community}}, we visually inspect \acp{UMAP} for log-STFT, log-mel and pretraine… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 16 publications
0
2
0
Order By: Relevance
“…Furthermore, autoencoders can localize anomalies in the input space by visualizing an element-wise reconstruction error as done in [19], [21]. However, training ASD models by using an auxiliary task usually enhances their performance [46]. Even for IM-based models, performance can be significantly improved when utilizing meta information such as machine types.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Furthermore, autoencoders can localize anomalies in the input space by visualizing an element-wise reconstruction error as done in [19], [21]. However, training ASD models by using an auxiliary task usually enhances their performance [46]. Even for IM-based models, performance can be significantly improved when utilizing meta information such as machine types.…”
Section: Introductionmentioning
confidence: 99%
“…This has been visualized using local interpretable model-agnostic explanations (LIME) [49] applied to sounds (SLIME) [50]. Furthermore, uniform manifold approximation and projection (UMAP) [51] has been used to visualize representations of the data such as stacked consecutive frames of log magnitude spectrograms, log-mel magnitude spectrograms, or openL3 embeddings [46].…”
Section: Introductionmentioning
confidence: 99%