2021
DOI: 10.1109/mmul.2020.3046491
|View full text |Cite
|
Sign up to set email alerts
|

ClaviNet: Generate Music With Different Musical Styles

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 3 publications
0
2
0
Order By: Relevance
“…The literature [12] analyzed the method of selecting the aesthetic value of music singing based on the perspective of new media, and the new media technology combined with online media and mobile media can realize the reform of music teaching and complete the classroom arrangement of singing as well as let students master the scientific way of vocal music. Literature [13] proposed continuous style embedding through the general formulation of variational self-encoder, compared two different methods of z integration into VAE, combined with a deep learning model to control the training of the dataset for generating musical styles and better music samples using a baseline model of discrete style labels. In the literature [14], a system of vocal aerodynamics was used in a study to determine the relationship between the vertical phase difference and the vocal gate efficiency of musical theater singers.…”
Section: Literature Reviewmentioning
confidence: 99%
“…The literature [12] analyzed the method of selecting the aesthetic value of music singing based on the perspective of new media, and the new media technology combined with online media and mobile media can realize the reform of music teaching and complete the classroom arrangement of singing as well as let students master the scientific way of vocal music. Literature [13] proposed continuous style embedding through the general formulation of variational self-encoder, compared two different methods of z integration into VAE, combined with a deep learning model to control the training of the dataset for generating musical styles and better music samples using a baseline model of discrete style labels. In the literature [14], a system of vocal aerodynamics was used in a study to determine the relationship between the vertical phase difference and the vocal gate efficiency of musical theater singers.…”
Section: Literature Reviewmentioning
confidence: 99%
“…In total, our vocabulary for the piano contains 730 unique tokens. 8 The songs in our dataset have ∼95 bars on average, which translate to 5,249 tokens per song on average using the piano representation. Accordingly, a 512-token sequence employed in model training (i.e., x 1:T ) contains about nine bars on average.…”
Section: B Token Representationmentioning
confidence: 99%