2020
DOI: 10.48550/arxiv.2011.06801
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Comprehensive Survey on Deep Music Generation: Multi-level Representations, Algorithms, Evaluations, and Future Directions

Shulei Ji,
Jing Luo,
Xinyu Yang

Abstract: The utilization of deep learning techniques in generating various contents (such as image, text, etc.) has become a trend. Especially music, the topic of this paper, has attracted widespread attention of countless researchers.The whole process of producing music can be divided into three stages, corresponding to the three levels of music generation: score generation produces scores, performance generation adds performance characteristics to the scores, and audio generation converts scores with performance char… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
38
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 33 publications
(38 citation statements)
references
References 160 publications
(298 reference statements)
0
38
0
Order By: Relevance
“…In music generation, sequence models with symbolic music representation have exhibited considerable success [12,19]. In particular, the combination of the Transformer model [25] and a token representation of music has been revealed to generate coherent and structural music [11,14]. However, the music representations used in prior studies were note-level (MIDI-equivalent); hence, their expressions were limited compared to those of musical scores.…”
Section: (B)mentioning
confidence: 99%
“…In music generation, sequence models with symbolic music representation have exhibited considerable success [12,19]. In particular, the combination of the Transformer model [25] and a token representation of music has been revealed to generate coherent and structural music [11,14]. However, the music representations used in prior studies were note-level (MIDI-equivalent); hence, their expressions were limited compared to those of musical scores.…”
Section: (B)mentioning
confidence: 99%
“…In recent years, deep neural networks (DNNs) have obtained significant improvement in varieties of artificial intelligence areas such as image recognition [1], neural language translation [2], automatic speech recognition [3], and automatic music generation [4]. Automatic music generation has two mainstream music generation approaches include audio music generation [5,6,7] and symbolic music generation [8,9,10].…”
Section: Introductionmentioning
confidence: 99%
“…a 4/4 musical piece. Because the musical note distribution is dominated by the holding state [4], which negatively affects the diversity of the generated notes, cosine distance is also designed as part of loss functions to prevent EMN from remembering the holding state instead of musical notes to enrich the diversity of musical notes.…”
Section: Introductionmentioning
confidence: 99%
“…Research on automatic symbolic music generation or style transfer has seen significant progress in recent years [5]- [11]. Among such efforts, the adoption of the Transformer decoderbased neural network [12] as the backbone generative model has become popular [13]- [24].…”
Section: Introductionmentioning
confidence: 99%