Proceedings of the 28th ACM International Conference on Multimedia 2020
DOI: 10.1145/3394171.3413848
|View full text |Cite
|
Sign up to set email alerts
|

Temporally Guided Music-to-Body-Movement Generation

Abstract: This paper presents a neural network model to generate virtual violinist's 3-D skeleton movements from music audio. Improved from the conventional recurrent neural network models for generating 2-D skeleton data in previous works, the proposed model incorporates an encoder-decoder architecture, as well as the selfattention mechanism to model the complicated dynamics in body movement sequences. To facilitate the optimization of self-attention model, beat tracking is applied to determine effective sizes and boun… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 30 publications
(7 citation statements)
references
References 22 publications
0
7
0
Order By: Relevance
“…Some researchers describe manufacturing capabilities as design and manufacturing capabilities, ascertained manufacturing capabilities, and actual manufacturing capabilities. According to manufacturing tasks, piano automation, equipment, and the relationship between roles, the model of the solution model of manufacturing capability from task expectation to demand deployment and the relationship model of piano automation composition from capability to role are established, and the piano automation composition hierarchical configuration model is proposed [22][23][24].…”
Section: Related Workmentioning
confidence: 99%
“…Some researchers describe manufacturing capabilities as design and manufacturing capabilities, ascertained manufacturing capabilities, and actual manufacturing capabilities. According to manufacturing tasks, piano automation, equipment, and the relationship between roles, the model of the solution model of manufacturing capability from task expectation to demand deployment and the relationship model of piano automation composition from capability to role are established, and the piano automation composition hierarchical configuration model is proposed [22][23][24].…”
Section: Related Workmentioning
confidence: 99%
“…To overcome this shortcoming and improve the pose dynamics estimation, the attention mechanism has been widely used by the existing generative methods. For instance, Kao et al developed a generativeattention-based network for synthesizing the movement of the skeleton of a violinist playing a particular piece of music [88]. This method first extracts the Mel-Frequency Cepstral Coefficients (MFCC) features of the audio.…”
Section: B Dance Choreography Generationmentioning
confidence: 99%
“…With the great success of deep neural networks, many researchers have addressed music-to-dance as a generation problem using learning-based techniques. Recent methods have employed auto encoderdecoders [41], LSTM [42][43][44][45][46], GANs [47,48], and transformers [49][50][51]. Even though some work [47,52] applies action units to further explore the correlations between pose and music, they still find it challenging to generate diverse, rhythmic, expressive dance motions.…”
Section: Music-driven Animationmentioning
confidence: 99%