In this paper we propose a template matching algorithm to address tempo tracking problem in MP3 domain. The algorithm is based on MP3 Window-Switching Pattern (WSP) only. This means that no frequency analysis is performed by the program itself. Because the WSP is structured coherently with the drums line it is possible to compare this pattern with a simple metronome template. Experimental results are presented for a range of different musical styles, including rock, jazz, and popular songs with a variety of BPM and time signature. A part of the experimentation is dedicated to analyze the train set proposed for MIREX 2006. A computational cost analysis is presented too.
State-of-the-art MIR issues are presented and discussed both from the symbolic and audio points of view. As for the symbolic aspects, different approaches are presented in order to provide an overview of the different available solutions for particular MIR tasks. This section ends with an overview of MX, the IEEE standard XML language specifically designed to support interchange between musical notation, performance, analysis, and retrieval applications. As for the audio level, first we focus on blind tasks like beat and tempo tracking, pitch tracking and automatic recognition of musical instruments. Then we present algorithms that work both on compressed and uncompressed data. We analyze the relationships between MIR and feature extraction presenting examples of possible applications. Finally we focus on automatic music synchronization and we introduce a new audio player that supports the MX logic layer and allows to play both score and audio coherently.
Nowadays more and more audio contents are stored in compressed formats. Especially MP3 music has become very popular with the availability of powerful computation and wide bandwidth connectivity. So that, this chapter will be devoted to present techniques and algorithms, dealing with compressed audio, aimed at content analysis. Since content analysis in compressed domain is an innovative field of applications, the literature review will be extended to methods that extract music content from MP3, even if the algorithms are not focused on music information retrieval. In this chapter, the authors focus on a number of different algorithms dealing with common tasks of the MIR field such as tempo induction, tempo tracking, and automatic music synchronization. They will present an overview of the MusicXML, and IEEE1599 language to represent score and synchronization results, because they have decided to use those formats to represent the score in their synchronization algorithm. The chapter will end showing applications, conclusions, and future works in the field of direct content analysis in compressed domain.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.