2016
DOI: 10.1145/2926717
|View full text |Cite
|
Sign up to set email alerts
|

Tempo Driven Audio-to-Score Alignment Using Spectral Decomposition and Online Dynamic Time Warping

Abstract: In this article, we present an online score following framework designed to deal with automatic accompaniment. The proposed framework is based on spectral factorization and online Dynamic Time Warping (DTW) and has two separated stages: preprocessing and alignment. In the first one, we convert the score into a reference audio signal using a MIDI synthesizer software and we analyze the provided information in order to obtain the spectral patterns (i.e., basis functions) associated to each score unit. In this wo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(20 citation statements)
references
References 15 publications
0
20
0
Order By: Relevance
“…In the comparison, four configurations of our method are presented. GT denotes the variant of our proposed method in which the perfect annotation score is used to guide the factorization stage, ScoreFree denotes the variant in which no alignment stage is used, and, therefore, the gains matrix A p , j ( t ) is initialized with random values, and Offline and Online make reference to variants whose alignment stages have been implemented following the Offline and Online approaches described in the work of Rodriguez‐Serrano et al Furthermore, we are also going to contrast our proposal with Oracle version, which uses the individual sources to establish the best separation that can be obtained using the proposed softmask reconstruction strategy. It sets an upper bound of all the configurations of the proposed method.…”
Section: Experiments and Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…In the comparison, four configurations of our method are presented. GT denotes the variant of our proposed method in which the perfect annotation score is used to guide the factorization stage, ScoreFree denotes the variant in which no alignment stage is used, and, therefore, the gains matrix A p , j ( t ) is initialized with random values, and Offline and Online make reference to variants whose alignment stages have been implemented following the Offline and Online approaches described in the work of Rodriguez‐Serrano et al Furthermore, we are also going to contrast our proposal with Oracle version, which uses the individual sources to establish the best separation that can be obtained using the proposed softmask reconstruction strategy. It sets an upper bound of all the configurations of the proposed method.…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…Finally, at the DTW block , the optimum path across matrix D ( τ , t ) is computed using DTW algorithm to provide the alignment between the score and performance times. More information about the alignment described earlier can be found in the works of Carabias‐Orti et al and Rodriguez‐Serrano et al…”
Section: Proposed Frameworkmentioning
confidence: 99%
See 1 more Smart Citation
“…In fact, classical offline systems rely on cost measures between events in the score and in the performance. Two methods well known in speech recognition have been extensively used in the literature: statistical approaches (e.g., hidden Markov model (HMM)) [22][23][24][25][26] and dynamic time warping (DTW) [19,[27][28][29].…”
Section: Introductionmentioning
confidence: 99%
“…Although several online audio-to-score approaches have been developed in the literature [22,28,29], only the works in [6,22,30] combined score alignment with SS in an online fashion. In [22] and in its extension [6], the alignment is performed using a hidden Markov process model, where each audio frame is associated with a 2-D state of score position and tempo.…”
Section: Introductionmentioning
confidence: 99%