Interspeech 2021 2021
DOI: 10.21437/interspeech.2021-1410
|View full text |Cite
|
Sign up to set email alerts
|

Low-Delay Speech Enhancement Using Perceptually Motivated Target and Loss

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 16 publications
0
5
0
Order By: Relevance
“…CRN was first introduced to speech enhancement by [56,57]. It nested a recurrent neural network (RNN) module inside the CNN-based encoder-decoder structure [58]. The RNN module is capable of handling long-term contexts in a sequencebased manner [59], but often requires high-level features.…”
Section: Structure Of Dvenmentioning
confidence: 99%
See 2 more Smart Citations
“…CRN was first introduced to speech enhancement by [56,57]. It nested a recurrent neural network (RNN) module inside the CNN-based encoder-decoder structure [58]. The RNN module is capable of handling long-term contexts in a sequencebased manner [59], but often requires high-level features.…”
Section: Structure Of Dvenmentioning
confidence: 99%
“…The CNN module is able to extract high-level features but mainly focuses on local temporal-spectral patterns [60]. Combining their advantages, the CRN structure has been shown to be very effective for speech enhancement [58,[61][62][63]. Motivated by [58,61], we determined three convolution layers for the encoder, three transposed convolution layers for the decoder, and two LSTM layers between them.…”
Section: Structure Of Dvenmentioning
confidence: 99%
See 1 more Smart Citation
“…Most of the studies (i.e. [11]- [21]) provided short-time objective intelligibility (STOI [22], [23]) scores, while a few (i.e. [16], [24], [25]) presented extended STOI (ESTOI [26]) scores.…”
Section: Introductionmentioning
confidence: 99%
“…Most of the studies (i.e. [8]- [18]) provided short-time objective intelligibility (STOI [19], [20]) scores, while a few (i.e. [13], [21], [22]) presented extended STOI (ESTOI [23]) scores.…”
Section: Introductionmentioning
confidence: 99%