“…Many deep learning models, such as feed-forward DNNs (FDNNs) [11], [12], [13], convolutional neural networks (CNNs) [11], [14], [15], recurrent neural networks (RNNs) [16], [17], [18], [19], gated recurrent units (GRUs) [20], [21], and generative adversarial networks (GANs) [22], [23], [24], are used for SE. To learn the temporal dependencies of speech signals, FDNNs have been extended to RNNs.…”