2020
DOI: 10.48550/arxiv.2002.09607
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Multi-Representation Knowledge Distillation For Audio Classification

Abstract: As an important component of multimedia analysis tasks, audio classification aims to discriminate between different audio signal types and has received intensive attention due to its wide applications. Generally speaking, the raw signal can be transformed into various representations (such as Short Time Fourier Transform and Mel Frequency Cepstral Coefficients), and information implied in different representations can be complementary. Ensembling the models trained on different representations can greatly boos… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 43 publications
0
1
0
Order By: Relevance
“…Another model based on the combination of representations is the model proposed by Gao et al (2020), in which they combined three types of audio representations, which fed two models, one trained for acoustic scene classification and the other for general audio tagging. They use the DCASE 2018 Challenge dataset 2 , achieving a mAP@3 of 93.3% in the acoustic scene classification task and an accuracy of 72.48% in the acoustic scene classification task, outperforming the results of other state-of-the-art methods based on a single representation.…”
Section: State Of the Artmentioning
confidence: 99%
“…Another model based on the combination of representations is the model proposed by Gao et al (2020), in which they combined three types of audio representations, which fed two models, one trained for acoustic scene classification and the other for general audio tagging. They use the DCASE 2018 Challenge dataset 2 , achieving a mAP@3 of 93.3% in the acoustic scene classification task and an accuracy of 72.48% in the acoustic scene classification task, outperforming the results of other state-of-the-art methods based on a single representation.…”
Section: State Of the Artmentioning
confidence: 99%