2019
DOI: 10.3390/app10010302
|View full text |Cite
|
Sign up to set email alerts
|

Flow Synthesizer: Universal Audio Synthesizer Control with Normalizing Flows

Abstract: The ubiquity of sound synthesizers has reshaped music production and even entirely defined new music genres. However, the increasing complexity and number of parameters in modern synthesizers make them harder to master. Hence, the development of methods allowing to easily create and explore with synthesizers is a crucial need.Here, we introduce a novel formulation of audio synthesizer control. We formalize it as finding an organized latent audio space that represents the capabilities of a synthesizer, while co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
43
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 38 publications
(44 citation statements)
references
References 4 publications
1
43
0
Order By: Relevance
“…We used 4 speech corpora for developing our multispeaker expressive TTS system. The speech corpora are Lisa [12], a French female neutral corpus (approx. 3 hrs), Caroline [20], a French female expressive corpus (approx.…”
Section: Data Preparationmentioning
confidence: 99%
See 1 more Smart Citation
“…We used 4 speech corpora for developing our multispeaker expressive TTS system. The speech corpora are Lisa [12], a French female neutral corpus (approx. 3 hrs), Caroline [20], a French female expressive corpus (approx.…”
Section: Data Preparationmentioning
confidence: 99%
“…For fast and high-fidelity wavenet based speech synthesis, the authors in [11] proposed probability density distillation to fill in the bridge between trained Wavenet as teacher model and IAF as a student model. In [12], the authors proposed a universal audio synthesizer built using normalizing flows to learn the latent space representation for semantic control of a synthesizer by interpolation of latent variables. In this paper, IAF were used as normalizing flows to perform the variational inference for learning meaningful latent space representation .…”
Section: Introductionmentioning
confidence: 99%
“…For the SVM, tested parameter values for C and gamma were respectively [0.01, 0.1, 1, 10, 100, 1000] and [1, 0.1, 0.01, 0.001, 0.00001, 0.000001, 10]. For KNN, the tested N values were [1,2,3,4,5,6,7,8,9,10] and for the Random forest [1, 10, 100, 500,1000] trees were considered. In the case of the FuzzyRuLer, d was explored from 1 to half the number of features in the dataset and ratio with [0.9, 0.8, 0.7, 0.6, 0.5] values.…”
Section: Cross-validationmentioning
confidence: 99%
“…In [ 8 ], a methodology is presented that relates the spaces of parameters and audio capabilities of a synthesizer in such a way that the mapping relating those spaces is invertible, which encourages high-level interactions with the synth. The system allows intuitive audio-based preset exploration.…”
Section: Introductionmentioning
confidence: 99%
“…Similarly to effects processing, sound synthesis has been a core subject at DAFx conferences from the beginning. Flow Synthesizer: Universal Audio Synthesizer Control with Normalizing Flows by Philippe Esling, Naotake Masuda, Adrien Bardet, Romeo Despres, and Axel Chemla-Romeu-Santos [12] highlights the importance of affordable, intuitive control in today's complex sound synthesizers. By using Variational Auto-Encoders and Normalizing Flows, the authors invert the synthesizer's map from its controls to a "latent" audio space, this way inferring parameters, learning control, drawing smooth transitions, and defining presets.…”
Section: Sound Synthesis Composition and Sonificationmentioning
confidence: 99%