Interspeech 2020 2020
DOI: 10.21437/interspeech.2020-2044
|View full text |Cite
|
Sign up to set email alerts
|

An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances

Abstract: In this paper, we propose a sub-utterance unit selection framework to remove acoustic segments in audio recordings that carry little information for acoustic scene classification (ASC). Our approach is built upon a universal set of acoustic segment units covering the overall acoustic scene space. First, those units are modeled with acoustic segment models (ASMs) used to tokenize acoustic scene utterances into sequences of acoustic segment units. Next, paralleling the idea of stop words in information retrieval… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 24 publications
0
1
0
Order By: Relevance
“…In recent years, we have witnessed a great progress in the acoustic scene classification (ASC) task, as demonstrated by the high participation in the IEEE Detection and Classification of Acoustic Scenes and Events (DCASE) challenges [1,2,3]. Top ASC systems use deep neural networks (DNNs), and the main ingredient of their success is the application of deep convolutional neural networks (CNNs) [4,5,6,7,8,9]. Further boost in ASC performance is obtained with the introduction of advanced deep learning techniques, such as attention mechanism [10,11,12], mix-up [13,14], Generative Adversial Network (GAN) and Variational Auto Encoder (VAE) based data augmentation [15,16], and deep feature learning [17,18,19,20].…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, we have witnessed a great progress in the acoustic scene classification (ASC) task, as demonstrated by the high participation in the IEEE Detection and Classification of Acoustic Scenes and Events (DCASE) challenges [1,2,3]. Top ASC systems use deep neural networks (DNNs), and the main ingredient of their success is the application of deep convolutional neural networks (CNNs) [4,5,6,7,8,9]. Further boost in ASC performance is obtained with the introduction of advanced deep learning techniques, such as attention mechanism [10,11,12], mix-up [13,14], Generative Adversial Network (GAN) and Variational Auto Encoder (VAE) based data augmentation [15,16], and deep feature learning [17,18,19,20].…”
Section: Introductionmentioning
confidence: 99%