2011
DOI: 10.1109/jstsp.2011.2145356
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Discovery of Temporal Structure in Music

Abstract: Abstract-We describe a data-driven algorithm for automatically identifying repeated patterns in music which analyzes a feature matrix using shift-invariant probabilistic latent component analysis. We utilize sparsity constraints to automatically identify the number of patterns and their lengths, parameters that would normally need to be fixed in advance, as well as to control the structure of the decomposition. The proposed analysis is applied to beat-synchronous chromagrams in order to concurrently extract re… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
12
0

Year Published

2013
2013
2025
2025

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 29 publications
(12 citation statements)
references
References 28 publications
0
12
0
Order By: Relevance
“…That dataset is composed of 176 songs and is traditionally used to evaluate such segmentation task [10,4,9,12]. We also evaluate against the Internet Archive part of the more recent SALAMI dataset [20], which contains 253 freely available songs.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…That dataset is composed of 176 songs and is traditionally used to evaluate such segmentation task [10,4,9,12]. We also evaluate against the Internet Archive part of the more recent SALAMI dataset [20], which contains 253 freely available songs.…”
Section: Discussionmentioning
confidence: 99%
“…We leave an exhaustive exploration of the parameters for future work due to the limitation of space, while still showing that we can obtain good results with this set of arguments. The results of the algorithm are compared against two other techniques that use matrix factorization for music segmentation: SI-PLCA [12] and a variant of our algorithm that uses classic NMF instead of C-NMF. The parameters used for SI-PLCA are the ones proposed for MIREX (see source code 4 ).…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Certain computational theories such as computational auditory scene analysis (CASA) also rely on such representations [3,8]. As an emerging approach for data analysis, NMF has been successfully used in such applications as music transcription, blind source separation and et al [9][10][11][12].…”
Section: Introductionmentioning
confidence: 99%
“…In [5], NMF is applied to a specific score matrix combining harmonic and timbral information. Moreover, sparse, convolutive NMF has been applied to spectrogram data in order to locate recurrent harmonic motifs as well as to infer high-level structure [6].…”
Section: Introductionmentioning
confidence: 99%