2020
DOI: 10.1016/j.apacoust.2020.107511
|View full text |Cite
|
Sign up to set email alerts
|

FLGCNN: A novel fully convolutional neural network for end-to-end monaural speech enhancement with utterance-based objective functions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(7 citation statements)
references
References 20 publications
0
7
0
Order By: Relevance
“…whereS cm r = |S cm |e jθ X andS cm i = |S cm |e jθ X . Both CME-Net and CSR-Net take the similar network topology as [13], which includes the gated convolutional encoder, decoder and stacked temporal convolution modules (dubbed TCMs) [14]. The encoder is utilized to extract the spectral patterns of the spectrum while the decoder is to reconstruct the spectrum.…”
Section: Two-stage Networkmentioning
confidence: 99%
“…whereS cm r = |S cm |e jθ X andS cm i = |S cm |e jθ X . Both CME-Net and CSR-Net take the similar network topology as [13], which includes the gated convolutional encoder, decoder and stacked temporal convolution modules (dubbed TCMs) [14]. The encoder is utilized to extract the spectral patterns of the spectrum while the decoder is to reconstruct the spectrum.…”
Section: Two-stage Networkmentioning
confidence: 99%
“…Compared to the existing models, the proposed method achieves higher SDR, PESQ, CORR, ESTOI, STOI, and SNR, as well as lower RMSE values. Moreover, the proposed model overcomes the drawbacks such as reduction in speech intelligibility [ 43 ], lower PESQ [ 40 ], lower robustness [ 37 ], not being suitable for complex noise environments [ 37 ], lower speech quality [ 10 ], and low SNR [ 45 ] [ 29 ]. However, this proposed method lacks at some noise sources and it did not determine the spectral magnitude and spectral phase estimation.…”
Section: Resultsmentioning
confidence: 99%
“…In 2020, Tan et al [ 45 ] introduced a Fully Convolutional Neural Network (FCNN) “to achieve end-to-end speech enhancement. The encoder and decoder, as well as an extra Convolutional-Based Short-Time Fourier Transform (CSTFT) layer and CISTFT layer, were applied to simulate forward as well as inverse STFT operations, respectively.…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…Both CME-Net and CSR-Net take the similar network topology as [13], which includes the gated convolutional encoder, decoder and stacked temporal convolution modules (dubbed TCMs) [14]. The encoder is utilized to extract the spectral patterns of the spectrum while the decoder is to reconstruct the spectrum.…”
Section: Two-stage Networkmentioning
confidence: 99%