ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9053815
|View full text |Cite
|
Sign up to set email alerts
|

Towards Linking the Lakh and IMSLP Datasets

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 8 publications
0
5
0
Order By: Relevance
“…In this set of experiments, we test our method on incomplete queries, modelling a realistic scenario where one may not have the entire piece for a search, but a fragment of an unknown audio recording or an unlabelled page of sheet music, and wishes to identify its originating piece. A similar study on MIDI-to-score retrieval was recently described in [11].…”
Section: Experiments 3: Fragment-level Retrievalmentioning
confidence: 70%
See 1 more Smart Citation
“…In this set of experiments, we test our method on incomplete queries, modelling a realistic scenario where one may not have the entire piece for a search, but a fragment of an unknown audio recording or an unlabelled page of sheet music, and wishes to identify its originating piece. A similar study on MIDI-to-score retrieval was recently described in [11].…”
Section: Experiments 3: Fragment-level Retrievalmentioning
confidence: 70%
“…Then we measure the final MRR value and the average search time per query. We repeat this procedure 10 times for each subset size and use the average of the results, similarly to [11].…”
Section: Experiments 4: Scalabilitymentioning
confidence: 99%
“…The bootleg score features for audio-sheet retrieval are a slight variant of the features described above (Tsai, 2020). There are two differences between the audio and sheet music (alignment) bootleg score representations that prevent their use in a cross-modal hashing framework.…”
Section: Feature Extractionmentioning
confidence: 99%
“…The second step is to identify the sheet music in IMSLP that matches the audio query. Our approach combines ideas from three recent papers: (1) automatic piano transcription using the Onsets and Frames system (Hawthorne et al, 2018), (2) bootleg score features modified for large-scale MIDI-sheet image retrieval (Tsai, 2020), and (3) dynamic n-gram fingerprinting (Yang and Tsai, 2020), which has recently demonstrated sub-second image-image retrieval of IMSLP piano sheet music. While this part of our system simply adapts and combines existing techniques, we believe this is the first empirical study of audio-sheet image retrieval at the scale of IMSLP and can serve as a useful baseline for the research community.…”
Section: Retrievalmentioning
confidence: 99%
“…While much of the research involving sheet music images revolves around optical music recognition (OMR) [1], there are a lot of other interesting applications and problems that can be solved without requiring OMR as an initial preprocessing step. Some recent examples include piece identification based on a cell phone picture of a physical page of sheet music [2], audio-sheet image score following and alignment [3,4], and finding matches between the Lakh MIDI dataset and the International Music Score Library Project (IMSLP) database [5]. This article explores a composer style classification task based on raw sheet music images.…”
Section: Introductionmentioning
confidence: 99%