2009 IEEE International Conference on Acoustics, Speech and Signal Processing 2009
DOI: 10.1109/icassp.2009.4959924
|View full text |Cite
|
Sign up to set email alerts
|

Audio classification from time-frequency texture

Abstract: Time-frequency representations of audio signals often resemble texture images. This paper derives a simple audio classification algorithm based on treating sound spectrograms as texture images. The algorithm is inspired by an earlier visual classification scheme particularly efficient at classifying textures. While solely based on time-frequency texture features, the algorithm achieves surprisingly good performance in musical instrument classification experiments.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
41
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 64 publications
(41 citation statements)
references
References 13 publications
0
41
0
Order By: Relevance
“…For example, in [17], a texture-of-texture approach is used, which recursively applies filters to spectrogram and MFCC images to capture certain shapes, such as horizontal and vertical lines in the image, can classify music into one of three genres. Another approach in [19] uses local image patches, which are randomly selected during training, to classify musical instruments. A feature is then extracted for classification based on the minimum matching energy of each patch over the testing spectrogram.…”
Section: Related Work a Literature Reviewmentioning
confidence: 99%
“…For example, in [17], a texture-of-texture approach is used, which recursively applies filters to spectrogram and MFCC images to capture certain shapes, such as horizontal and vertical lines in the image, can classify music into one of three genres. Another approach in [19] uses local image patches, which are randomly selected during training, to classify musical instruments. A feature is then extracted for classification based on the minimum matching energy of each patch over the testing spectrogram.…”
Section: Related Work a Literature Reviewmentioning
confidence: 99%
“…The Short-Time Fourier Transform (STFT) was used to calculate the spectrogram ‫ݏ‬ሺ‫ݔ‬ǡ ‫ݕ‬ሻ [12], and the frames were taken to be 256-point frames with 192-point overlap.…”
Section: B Environmental Sound Spectrogram Calculationmentioning
confidence: 99%
“…The time-frequency representation (TFR) contains the complete information of an audio signal in both spectral and temporal domain [7]. The digital audio stream is divided into segments of fixed length for spectrogram generation using TFR.…”
Section: Time-frequency Representationmentioning
confidence: 99%
“…Little attentions have been given to audio classification in the visual domain. The time-frequency based (spectrogram) features can be used in audio discrimination in the analogous way of image classification [6] [7].…”
Section: Introductionmentioning
confidence: 99%