ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9414364
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Multitrack Mixing With A Differentiable Mixing Console Of Neural Audio Effects

Abstract: Applications of deep learning to automatic multitrack mixing are largely unexplored. This is partly due to the limited available data, coupled with the fact that such data is relatively unstructured and variable. To address these challenges, we propose a domain-inspired model with a strong inductive bias for the mixing task. We achieve this with the application of pre-trained sub-networks and weight sharing, as well as with a sum/difference stereo loss function. The proposed model can be trained with a limited… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 20 publications
(17 citation statements)
references
References 14 publications
0
17
0
Order By: Relevance
“…Following this convention, we designed the Mastering Effects Manipulator as the most basic mastering chain by implementing parametric equalizer (EQ) for tonal control, multiband stereo imager for spatial adjustment, and maximizer (or compressor) for voluminous control. For the gain controller, EQ, and maximizer, we follow the implementation to pymixconsole 1 -the collection of audio effects modules presented at [10], and implement the multiband stereo imager that manipulates stereo width in each band using Linkwitz-Riley crossover filter [11].…”
Section: Mastering Effects Manipulatormentioning
confidence: 99%
See 2 more Smart Citations
“…Following this convention, we designed the Mastering Effects Manipulator as the most basic mastering chain by implementing parametric equalizer (EQ) for tonal control, multiband stereo imager for spatial adjustment, and maximizer (or compressor) for voluminous control. For the gain controller, EQ, and maximizer, we follow the implementation to pymixconsole 1 -the collection of audio effects modules presented at [10], and implement the multiband stereo imager that manipulates stereo width in each band using Linkwitz-Riley crossover filter [11].…”
Section: Mastering Effects Manipulatormentioning
confidence: 99%
“…For LMSS, we utilize the original implementation of [18] with FFT sizes of (4096, 2048, 1024, 512) and apply it to both stereo-channeled and mid/side-channeled outputs. Computing mid/side-channeled outputs with multi-scale spec-tral objective has been proposed as the stereo loss function in [10], which is to separately calculate summation and subtraction of left and right channels with LMSS so that the phase-related information can also be addressed. The total objective function for the Mastering Cloner is as follow:…”
Section: Mastering Clonermentioning
confidence: 99%
See 1 more Smart Citation
“…However, this approach requires manual implementation and often modification of DSP, imparting high engineering cost, potentially limiting its application. To work around this, two alternative methods have been proposed including neural proxies (NP) of audio effects [16] and numerical gradient approximation schemes [17]. However, given that these approaches were proposed and evaluated in different tasks, it is difficult to fully understand their relative performance.…”
Section: Introductionmentioning
confidence: 99%
“…unprocessed) source tracks are required. Given the recent advances in automatic mixing [4,5] and music source separation [6], a system could be developed that facilitates the adjustment of a stereo mixture to the user's taste and preferences similar to [7]. However, today's source separation systems are commonly trained on data that is based on music stems (e.g., MUSDB18 [8]).…”
Section: Introductionmentioning
confidence: 99%