2018
DOI: 10.48550/arxiv.1812.07159
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Autoencoder Based Architecture For Fast & Real Time Audio Style Transfer

Abstract: Recently, there has been great interest in the field of audio style transfer, where a stylized audio is generated by imposing the style of a reference audio on the content of a target audio. We improve on the current approaches which use neural networks to extract the content and the style of the audio signal and propose a new autoencoder based architecture for the task. This network generates a stylized audio for a content audio in a single forward pass. The proposed network architecture proves to be advantag… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2019
2019
2019
2019

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 2 publications
0
1
0
Order By: Relevance
“…In the case of style transfer problems for audio data, style and content are ambiguous [18]. Consequently, style transfer approaches for audio have largely resulted in efforts that match the timbres of the source and target domains (for example, opera sounds have been matched to sound like cats licking milk) [18,19]. In such cases, it is difficult to define what a good network output is supposed to sound like.…”
Section: Related Workmentioning
confidence: 99%
“…In the case of style transfer problems for audio data, style and content are ambiguous [18]. Consequently, style transfer approaches for audio have largely resulted in efforts that match the timbres of the source and target domains (for example, opera sounds have been matched to sound like cats licking milk) [18,19]. In such cases, it is difficult to define what a good network output is supposed to sound like.…”
Section: Related Workmentioning
confidence: 99%