Proceedings of the International Workshop on Workshop on Multimedia Information Retrieval 2007
DOI: 10.1145/1290082.1290105
|View full text |Cite
|
Sign up to set email alerts
|

Tempo induction algorithm in MP3 compressed domain

Abstract: In this paper we propose a template matching algorithm to address tempo tracking problem in MP3 domain. The algorithm is based on MP3 Window-Switching Pattern (WSP) only. This means that no frequency analysis is performed by the program itself. Because the WSP is structured coherently with the drums line it is possible to compare this pattern with a simple metronome template. Experimental results are presented for a range of different musical styles, including rock, jazz, and popular songs with a variety of BP… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2010
2010
2014
2014

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…It is well known that Zernike moment has been widely used in many image-related research fields such as image recognition [11], image watermarking [12], human face recognition [13], and image analysis [14] due to its prominent property of strong robustness and rotation, scale, and translation (RST) invariance. So far, various compressed domain audio features including scale factors [15,16], MP3 window-switching pattern [17,18], basic MDCT coefficients and derived spectral energy, energy variation, duration of energy peaks, amplitude envelope, spectrum centroid, spectrum spread, spectrum flux, roll-off, RMS, rhythmic content like beat histogram [19][20][21][22][23][24] have been used in different applications such as retrieval, segmentation, genre classification, speech/ music discrimination, summarization, singer identification, watermarking, and beat tracing/tempo induction. However, in spite of the extensive use in various imagerelated research fields for years, to the authors' knowledge, Zernike moment has not yet been applied to music information retrieval.…”
Section: Introductionmentioning
confidence: 99%
“…It is well known that Zernike moment has been widely used in many image-related research fields such as image recognition [11], image watermarking [12], human face recognition [13], and image analysis [14] due to its prominent property of strong robustness and rotation, scale, and translation (RST) invariance. So far, various compressed domain audio features including scale factors [15,16], MP3 window-switching pattern [17,18], basic MDCT coefficients and derived spectral energy, energy variation, duration of energy peaks, amplitude envelope, spectrum centroid, spectrum spread, spectrum flux, roll-off, RMS, rhythmic content like beat histogram [19][20][21][22][23][24] have been used in different applications such as retrieval, segmentation, genre classification, speech/ music discrimination, summarization, singer identification, watermarking, and beat tracing/tempo induction. However, in spite of the extensive use in various imagerelated research fields for years, to the authors' knowledge, Zernike moment has not yet been applied to music information retrieval.…”
Section: Introductionmentioning
confidence: 99%