2021
DOI: 10.1016/j.ins.2021.03.014
|View full text |Cite
|
Sign up to set email alerts
|

Non-local musical statistics as guides for audio-to-score piano transcription

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(4 citation statements)
references
References 25 publications
0
4
0
Order By: Relevance
“…the automata constructions are performed on-the-fly during best-search for efficiency reasons. One advantage of this swVPA approach is the global view provided by the stack during transcription, as opposed to other HMM-based approaches [22].…”
Section: Discussionmentioning
confidence: 99%
“…the automata constructions are performed on-the-fly during best-search for efficiency reasons. One advantage of this swVPA approach is the global view provided by the stack during transcription, as opposed to other HMM-based approaches [22].…”
Section: Discussionmentioning
confidence: 99%
“…We note that in related studies on rhythm transcription, researchers also apply HMM-based methods to take an input signal and estimate the metrical positions of all musical notes. In particular, similar first-order Markov assumptions and tempo transition probability distributions are adopted [38], [39], [40]. Despite that local tempo and local tempo changes are parameterized in these models, parameters are determined globally (e.g., based on a dataset) without the knowledge of "local periodicity" (which is explicitly extracted based on small local windows) proposed in this work.…”
Section: B Ppts With Global Assumptions For Tempomentioning
confidence: 99%
“…Owing to the recent development of automatic music transcription [23][24][25], accurate score information can be extracted from polyphonic music signals. Since the proposed method uses the interpretable synthesis parameters, we can utilize the available score information for the initialization of fr,t and lr,t.…”
Section: Initialization Of F0 and Loudness Using Score Informationmentioning
confidence: 99%