2020
DOI: 10.3788/lop57.081006
|View full text |Cite
|
Sign up to set email alerts
|

Optical Music Recognition Method Combining Multi-Scale Residual Convolutional Neural Network and Bi-Directional Simple Recurrent Units

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…In order to solve the problem of low recognition accuracy of music score images, Huang Zhiqing et al [6] developed an end-to-end target detection model to recognize music score notes based on Darknet53 base network and feature fusion technology extracts the feature map of music score images using the Darknet53 deep neural network and then spliced the upper feature map of the neural network with the feature map to complete the feature fusion so that the notes have a clearer feature texture, and then recognized them. In order to solve the problem of long recognition time of music score images, Wu Qiong et al [7 ] improved CNN to residual CNN based on the C-BiLSTM model of convolutional neural circulation network and formed the residual convolutional neural circulation network RC-BiLSTM and recognized the optical music score based on the note properties of the music score. Liang Mingheng [8] uses the depth network to identify musical notes, accepts the entire musical score image as input, and outputs the time value and pitch of the notes.…”
Section: Related Workmentioning
confidence: 99%
“…In order to solve the problem of low recognition accuracy of music score images, Huang Zhiqing et al [6] developed an end-to-end target detection model to recognize music score notes based on Darknet53 base network and feature fusion technology extracts the feature map of music score images using the Darknet53 deep neural network and then spliced the upper feature map of the neural network with the feature map to complete the feature fusion so that the notes have a clearer feature texture, and then recognized them. In order to solve the problem of long recognition time of music score images, Wu Qiong et al [7 ] improved CNN to residual CNN based on the C-BiLSTM model of convolutional neural circulation network and formed the residual convolutional neural circulation network RC-BiLSTM and recognized the optical music score based on the note properties of the music score. Liang Mingheng [8] uses the depth network to identify musical notes, accepts the entire musical score image as input, and outputs the time value and pitch of the notes.…”
Section: Related Workmentioning
confidence: 99%
“…The end-to-end method [2,[4][5][6]11,[32][33][34] has become the mainstream of OMR for its simplicity of data preprocessing. The vast majority of the end-to-end methods is sequence recognition, which means that it directly converts the stave images into symbol sequences with the pitch and type of each note.…”
Section: Related Workmentioning
confidence: 99%
“…Based on the CRNN, Calvo-Zaragoza et al [2,3] proposed the CRNN-CTC by improving the network structure of CRNN and optimizing the mapping dictionary of musical symbol sequences, and achieved note type accuracy of 99.2% on printed dataset PrIMuS [3] and 96.6% on synthetically distorted dataset Camera-PrIMuS [11]. Qiong W et al [4] improved the CRNN-CTC by replacing the CNN with a multi-scale residual CNN and changing the BilSTM unit with BiSRU. It achieved an accuracy of 99.7% on the PrIMuS dataset.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…With the continuous development of the field of computer vision, deep learning technology has also made continuous progress, and there are more and more studies using deep learning technology to solve the problem of optical score recognition [5][6][7] . Therefore, this paper proposes a musical note image recognition method based on the convolutional neural network model, and constructs a single-input multiple-output network model to recognize the musical note image.…”
Section: Introductionmentioning
confidence: 99%