2024
DOI: 10.1609/aaai.v38i12.29294
|View full text |Cite
|
Sign up to set email alerts
|

Learning Temporal Resolution in Spectrogram for Audio Classification

Haohe Liu,
Xubo Liu,
Qiuqiang Kong
et al.

Abstract: The audio spectrogram is a time-frequency representation that has been widely used for audio classification. One of the key attributes of the audio spectrogram is the temporal resolution, which depends on the hop size used in the Short-Time Fourier Transform (STFT). Previous works generally assume the hop size should be a constant value (e.g., 10 ms). However, a fixed temporal resolution is not always optimal for different types of sound. The temporal resolution affects not only classification accuracy but als… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
references
References 42 publications
0
0
0
Order By: Relevance