2021
DOI: 10.1049/cit2.12065
|View full text |Cite
|
Sign up to set email alerts
|

A transformer generative adversarial network for multi‐track music generation

Abstract: This study proposes a new generation network based on transformers and guided by the music theory to produce high‐quality music work. In this study, the decoding block of the transformer is used to learn the internal information of single‐track music, and cross‐track transformers are used to learn the information amongst the tracks of different musical instruments. A reward network based on the music theory is proposed, which optimizes the global and local loss objective functions while training and discrimina… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
13
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8
2

Relationship

1
9

Authors

Journals

citations
Cited by 30 publications
(13 citation statements)
references
References 25 publications
0
13
0
Order By: Relevance
“…Its effectiveness in classification, approximation, and prediction has been demonstrated in various studies, including refs. [17, 19–40].…”
Section: Introductionmentioning
confidence: 99%
“…Its effectiveness in classification, approximation, and prediction has been demonstrated in various studies, including refs. [17, 19–40].…”
Section: Introductionmentioning
confidence: 99%
“…Deep learning (DL) represents a class of cutting-edge ML algorithms inspired by neural networks. The DL models have been able to achieve near-human accuracy levels in various types of classification and prediction tasks including images, text, speech, and video data [1][2][3]. Thus the demand to capitalise on the potential shown by the DL solutions continues to grow.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, we propose a framework to efficiently and intelligently implement music generation and immersive sound field twinning for concerts in the metaverse, namely MetaMGC (Music Generation Framework for Concerts in Metaverse). It consists of three main parts: (1) a music generation part that enables improvised accompaniment of a virtual orchestra for metaverse concerts; (2) a digital audio twin part that enables virtual sound field reconstruction for metaverse concerts [6]; and (3) an audio rendering part that realizes the virtual soundstage production of the metaverse concert.…”
Section: Introductionmentioning
confidence: 99%