2021
DOI: 10.48550/arxiv.2103.02147
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Reverb Conversion of Mixed Vocal Tracks Using an End-to-end Convolutional Deep Neural Network

Abstract: Reverb plays a critical role in music production, where it provides listeners with spatial realization, timbre, and texture of the music. Yet, it is challenging to reproduce the musical reverb of a reference music track even by skilled engineers. In response, we propose an end-to-end system capable of switching the musical reverb factor of two different mixed vocal tracks. This method enables us to apply the reverb of the reference track to the source track to which the effect is desired. Further, our model ca… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 8 publications
0
3
0
Order By: Relevance
“…For the task of acoustic matching for audio post-production, Sarroff and Michaels [11] proposed an RNN trained to predict the parameters of an algorithmic reverb given a reverberant recording. Koo et al [12] extended this with a U-Net trained to directly convert singing voice recordings to match a target signal, forgoing the need to estimate the parameters of an algorithmic reverb. Nevertheless, while these results are convincing, both methods rely upon training data generated with algorithmic reverbs, which may limit generalization for matching real environments in augmented reality.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…For the task of acoustic matching for audio post-production, Sarroff and Michaels [11] proposed an RNN trained to predict the parameters of an algorithmic reverb given a reverberant recording. Koo et al [12] extended this with a U-Net trained to directly convert singing voice recordings to match a target signal, forgoing the need to estimate the parameters of an algorithmic reverb. Nevertheless, while these results are convincing, both methods rely upon training data generated with algorithmic reverbs, which may limit generalization for matching real environments in augmented reality.…”
Section: Related Workmentioning
confidence: 99%
“…While these parameters provide useful information about room acoustics, they can be limiting in cases where auralization of the RIR is required. This is especially the case with augmented and virtual reality (AR and VR), and is also the case with reverb matching procedures for audio post-production [9][10][11][12]. In these cases, accurately matching the characteristics of the room acoustics is generally required, which often extends beyond T60 and DRR.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation