Interspeech 2017 2017
DOI: 10.21437/interspeech.2017-170
|View full text |Cite
|
Sign up to set email alerts
|

Predicting Speech Intelligibility Using a Gammachirp Envelope Distortion Index Based on the Signal-to-Distortion Ratio

Abstract: A new intelligibility prediction measure, called "Gammachirp Envelope Distortion Index (GEDI)" is proposed for the evaluation of speech enhancement algorithms. This model calculates the signal-to-distortion ratio (SDR) in envelope responses SDR env derived from the gammachirp filterbank outputs of clean and enhanced speech, and is an extension of the speech based envelope power spectrum model (sEPSM) to improve prediction and usability. An evaluation was performed by comparing human subjective results and mode… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2

Relationship

3
3

Authors

Journals

citations
Cited by 10 publications
(17 citation statements)
references
References 16 publications
0
17
0
Order By: Relevance
“…The dcGC-FB was also used in models for speech intelligibility prediction [9][10][11][12][13]. A new model referred to as GEDI (the gammachirp envelope distortion index) [10,11] predicted the intelligibility of speech sounds processed with non-linear enhancement algorithms better than other recent indexes like STOI, CSII, and HASPI [12,13].…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…The dcGC-FB was also used in models for speech intelligibility prediction [9][10][11][12][13]. A new model referred to as GEDI (the gammachirp envelope distortion index) [10,11] predicted the intelligibility of speech sounds processed with non-linear enhancement algorithms better than other recent indexes like STOI, CSII, and HASPI [12,13].…”
Section: Resultsmentioning
confidence: 99%
“…We begin by defining the relationship between the ratio, f rat , and the total stimulus level at the output of the pGC, P gcp , as shown in Eq. (10). The slope of the ratio, f ð1Þ…”
Section: Compression In the Cgc And Its Inversementioning
confidence: 99%
See 2 more Smart Citations
“…The STOI is intended to assess the intelligibility of speech processed via an ideal time-frequency segregation (ITFS). It has been reported, however, that the STOI was not successful at predicting the intelligibility of speech sounds enhanced by via Wiener filtering [3] and a recent DNN-based enhancement algorithm [4].…”
Section: Introductionmentioning
confidence: 99%