ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9413732
|View full text |Cite
|
Sign up to set email alerts
|

Instrument Classification of Solo Sheet Music Images

Abstract: This paper studies instrument classification of solo sheet music. Whereas previous work has focused on instrument recognition in audio data, we instead approach the instrument classification problem using raw sheet music images. Our approach first converts the sheet music image into a sequence of musical words based on the bootleg score representation, and then treats the problem as a text classification task. We show that it is possible to significantly improve classifier performance by training a language mo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 8 publications
0
1
0
Order By: Relevance
“…One which has taken the interest of researchers is the use of transformer-based language models for different musical-related tasks. For instance, previous work compared the performance of different transformer-based approaches for composer style classification [23] and instrument classification [16] by converting raw images of piano sheet music to text and treating these problems as text classification tasks. In [2], the XLNet language model is used to recognise emotion from lyrics, while in [11] GPT-2 is used to generate musical sequences in ABC notation.…”
Section: Transformer-based Approaches In Musicmentioning
confidence: 99%
“…One which has taken the interest of researchers is the use of transformer-based language models for different musical-related tasks. For instance, previous work compared the performance of different transformer-based approaches for composer style classification [23] and instrument classification [16] by converting raw images of piano sheet music to text and treating these problems as text classification tasks. In [2], the XLNet language model is used to recognise emotion from lyrics, while in [11] GPT-2 is used to generate musical sequences in ABC notation.…”
Section: Transformer-based Approaches In Musicmentioning
confidence: 99%