Interspeech 2012 2012
DOI: 10.21437/interspeech.2012-197
|View full text |Cite
|
Sign up to set email alerts
|

Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
35
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 73 publications
(35 citation statements)
references
References 8 publications
0
35
0
Order By: Relevance
“…To combat environmental degradation, a variety of speech enhancement algorithms have been proposed (see [2] for an overview). There are two main approaches to speech enhancement: 1) the speech signal can be modified prior to degradation (e.g., optimal energy redistribution [4] and dynamic range compression [5]), or 2) the speech signal can be modified after degradation has been introduced (e.g., Wiener filters [6]). The former type of algorithm is referred to as a pre-processing algorithm and the latter as a post-processing algorithm.…”
Section: Introductionmentioning
confidence: 99%
“…To combat environmental degradation, a variety of speech enhancement algorithms have been proposed (see [2] for an overview). There are two main approaches to speech enhancement: 1) the speech signal can be modified prior to degradation (e.g., optimal energy redistribution [4] and dynamic range compression [5]), or 2) the speech signal can be modified after degradation has been introduced (e.g., Wiener filters [6]). The former type of algorithm is referred to as a pre-processing algorithm and the latter as a post-processing algorithm.…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, we introduce ŝ, the signal example that is enhanced by reference modification algorithms such as SSDRC [3], in the D training process. The loss function is thus extended to Equation (4).…”
Section: Model Description and Training Processmentioning
confidence: 99%
“…The 810 English sentences (90 sentences × 9 SNRs) were used for testing only. To prepare the enhanced signal example ŝ introduced in Section 3.2, we selected three reference algorithms: (1) OptSII [14], a linear filter to maximize the Speech Intelligibility Index (SII), (2) OptMI [15], a linear filter to optimally redistribute energy based on mutual information criterion, and (3) SSDRC [3], a method to integrate spectral shaping and dynamic range compression. Each training sentence was randomly processed by one of these three algorithms to obtain its enhanced example.…”
Section: Model Description and Training Processmentioning
confidence: 99%
See 1 more Smart Citation
“…In the context of Lombard speech, there have been a number of studies on improving speech intelligibility. Spectral shaping and dynamic range compression, induced statically or adaptively (by optimizing an objective measure of intelligibility), have proven highly effective under various noise conditions [19,20]. We investigate the intelligiblity-enhancing properties of neural-adapted Lombard speech by comparing it to a post-processing strategy based on an intelligibilityenhancing algorithm.…”
Section: Introductionmentioning
confidence: 99%