“…Recently, the research community has witnessed a shift in methodology from conventional signal processing methods [6,7] to data-driven enhancement approaches, particularly those based on deep learning paradigms [8,9,3,10,11]. Beside discriminative modeling with typical deep network variants, such as deep neural networks (DNNs) [8], convolutional neural networks (CNNs) [9,10], and recurrent neural networks (RNNs) [11,3], generative modeling with GANs [12] have been shown to hold promise for speech enhancement [13,14,15]. Furthermore, the study in [15] indicates that generative modeling with GANs may result in fewer artefacts than discriminative methods.…”