2005
DOI: 10.1109/tmm.2004.840597
|View full text |Cite
|
Sign up to set email alerts
|

Audio thumbnailing of popular music using chroma-based representations

Abstract: Abstract-With the growing prevalence of large databases of multimedia content, methods for facilitating rapid browsing of such databases or the results of a database search are becoming increasingly important. However, these methods are necessarily media dependent. We present a system for producing short, representative samples (or "audio thumbnails") of selections of popular music. The system searches for structural redundancy within a given song with the aim of identifying something like a chorus or refrain.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
125
0
2

Year Published

2006
2006
2021
2021

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 195 publications
(130 citation statements)
references
References 14 publications
0
125
0
2
Order By: Relevance
“…The decomposition is realized by a suitable multirate filter bank consisting of elliptic filters [13]. This representation of audio signals can then be used as a basis for deriving various audio features of various characteristics [13,14], such as chroma pitch, chroma log pitch, and chroma energy normalized statics [15]. Figure 1 shows the waveform of an audio recording and its corresponding pitch features in various noise environments.…”
Section: Pitch-based Audio Featuresmentioning
confidence: 99%
“…The decomposition is realized by a suitable multirate filter bank consisting of elliptic filters [13]. This representation of audio signals can then be used as a basis for deriving various audio features of various characteristics [13,14], such as chroma pitch, chroma log pitch, and chroma energy normalized statics [15]. Figure 1 shows the waveform of an audio recording and its corresponding pitch features in various noise environments.…”
Section: Pitch-based Audio Featuresmentioning
confidence: 99%
“…Another application is that of music thumbnailing [17], which seeks to automatically identify a representative excerpt from a song. Utilizing the analysis described in this section, the thumbnail that best represents the identified motif (typically a good thumbnail since it is the most often repeated pattern found in the song) simply corresponds to the largest activation in H. If a longer thumbnail is required, then L or the parameters of α wτ can be scaled up to identify longer patterns.…”
Section: Riff Identificationmentioning
confidence: 99%
“…This property has been exploited for tasks as diverse as visualization, rhythmic analysis, automatic summarization and thumbnailing, chorus detection, annotation, synchronization and long-term segmentation [16]- [20]. With a few exceptions [17], [21], [22], the emphasis of this research has been on locating repetitions rather than on extracting of characteristic, repetitive patterns. The utility of extracting such patterns is illustrated by previous research on detecting motif occurrences across a collection [23] and cover-song retrieval based on feature sub-sequences [24].…”
Section: Introductionmentioning
confidence: 99%
“…Chroma features are widely used in applications such as cover song detection, transcription, and recommender systems (see, e.g. [7][8][9]). Most methods for chroma estimation begin with some pitch estimation, which then maps into its respective chroma.…”
Section: Introductionmentioning
confidence: 99%