2016
DOI: 10.1109/taslp.2016.2593801
|View full text |Cite
|
Sign up to set email alerts
|

Piano Transcription in the Studio Using an Extensible Alternating Directions Framework

Abstract: Given a musical audio recording, the goal of automatic music transcription is to determine a score-like representation of the piece underlying the recording. Despite significant interest within the research community, several studies have reported on a "glass ceiling" effect, an apparent limit on the transcription accuracy that current methods seem incapable of overcoming. In this paper, we explore how much this effect can be mitigated by focusing on a specific instrument class and making use of additional inf… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
36
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 18 publications
(36 citation statements)
references
References 52 publications
0
36
0
Order By: Relevance
“…those close to the decision boundary. In [3], the corresponding threshold amin was derived from user input as this threshold depends on the recording level. More precisely, the user is asked to provide an example of a note having the lowest intensity to be expected in a recording session (during the evaluation this was one value for the entire dataset and not specific to recordings).…”
Section: Thresholding Based On Glasberg-moore Modelmentioning
confidence: 99%
See 3 more Smart Citations
“…those close to the decision boundary. In [3], the corresponding threshold amin was derived from user input as this threshold depends on the recording level. More precisely, the user is asked to provide an example of a note having the lowest intensity to be expected in a recording session (during the evaluation this was one value for the entire dataset and not specific to recordings).…”
Section: Thresholding Based On Glasberg-moore Modelmentioning
confidence: 99%
“…the activations for the onset part of each note pattern in P . The same representation was used in [3] for the final onset detection. We used LSTM networks in two different configurations.…”
Section: Lstm-based Decodingmentioning
confidence: 99%
See 2 more Smart Citations
“…In the supervised NMF, templates were usually formed by the isolated notes of the specific piano to be transcribed. Ewert employed spectro-temporal patterns to model the temporal evolution in NMF [31]. Cheng proposed a method to model the attack and decay of notes, and all the templates were trained by a Disklavier piano [32].…”
Section: Introductionmentioning
confidence: 99%