2019 27th European Signal Processing Conference (EUSIPCO) 2019
DOI: 10.23919/eusipco.2019.8902941
|View full text |Cite
|
Sign up to set email alerts
|

Sounding Industry: Challenges and Datasets for Industrial Sound Analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 36 publications
(17 citation statements)
references
References 11 publications
0
17
0
Order By: Relevance
“…Many challenges arise when ASC models are deployed in smart city [90,91] or industrial sound analysis [92] scenarios. The first challenge is the model complexity, which is limited if data privacy concerns require the classification to be performed directly on mobile sensor devices.…”
Section: Real-world Deploymentmentioning
confidence: 99%
“…Many challenges arise when ASC models are deployed in smart city [90,91] or industrial sound analysis [92] scenarios. The first challenge is the model complexity, which is limited if data privacy concerns require the classification to be performed directly on mobile sensor devices.…”
Section: Real-world Deploymentmentioning
confidence: 99%
“…The 10 best performing submissions in 2020 applied Deep Learning (DL), treating the developement dataset as training data. Some used different forms of data augmentation and additions from ex-ternal datasets such as AudioSet [6] and Fraunhofer's IDMT-ISA-EE dataset [2]. For inference, most submissions directly applied the trained DL models, often via multi-task ensembles.…”
Section: Uad In Dcasementioning
confidence: 99%
“…For our data sources, we merged the Development and Additional Training datasets [12] from DCASE 2021, task 2. To illustrate the role of external datasets, we also incorporated the 10-second cut variant of Fraunhofer's IDMT-ISA-EE dataset [2], and a custom subset of AudioSet consisting of 10-second segments from ∼40k unique videos. All audio files were converted to mono 16kHz, and (−1, 1) normalization was applied.…”
Section: Umap For Dcase 2021mentioning
confidence: 99%
See 1 more Smart Citation
“…Finally, sound requires lesser computational resources as compared to an image or video [3], [4]. Thus, a SED system has great potential in many other domains besides speech recognition [5], [6] such as medical telemonitoring [7], surveillance [8], [9], equipment monitoring [10] [11] and wildlife monitoring [12], [13].…”
Section: Introductionmentioning
confidence: 99%