Predicting Speech Intelligibility Using a Gammachirp Envelope Distortion Index Based on the Signal-to-Distortion Ratio

Yamamoto, Kazumasa; Matsui, Toshie; Araki, Shoko; Kinoshita, Keisuke; Nakatani, Tomohiro

doi:10.21437/interspeech.2017-170

Cited by 10 publications

(17 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The dcGC-FB was also used in models for speech intelligibility prediction [9][10][11][12][13]. A new model referred to as GEDI (the gammachirp envelope distortion index) [10,11] predicted the intelligibility of speech sounds processed with non-linear enhancement algorithms better than other recent indexes like STOI, CSII, and HASPI [12,13].…”

Section: Resultsmentioning

confidence: 99%

“…We begin by defining the relationship between the ratio, f rat , and the total stimulus level at the output of the pGC, P gcp , as shown in Eq. (10). The slope of the ratio, f ð1Þ…”

Section: Compression In the Cgc And Its Inversementioning

confidence: 99%

“…This average level is used to determine the frequency ratio for the HP-AF, f rat , in Eq. (10) and, subsequently, the gain of the ''inverse'' HP-AF. The gain vector for the filterbank channels is used as the ''inverse'' excitation pattern.…”

Section: Frame-based Time-varying Filtermentioning

confidence: 99%

“…The GC architecture enables us to construct a hearing impairment simulator which allows normal hearing listeners to experience the difficulties of hearing impaired listeners [5,6]. The GC has also been used to model speaker size perception [7,8] and speech intelligibility [9][10][11][12][13].…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

The gammachirp auditory filter and its application to speech perception

Patterson

2020

Acoust. Sci. & Tech.

Self Cite

View full text Add to dashboard Cite

We review the gammachirp (GC) auditory filter and its use in speech perception research. The GC was originally developed to explain the asymmetric, auditory filter shapes derived in notchednoise (NN) masking studies, and the strongly compressive input-output function observed in the mammalian cochlea. This compressive GC was fitted to a very large collection of notched-noise (NN) masking thresholds measured with a wide range of stimulus levels and center frequencies. The fit showed how the GC auditory filter could explain NN masking throughout the domain of human hearing with a relatively small number of parameters, only one of which was level dependent. Subsequently, a dynamic, compressive GC filterbank (dcGC-FB) was developed to simulate timedomain cochlear processing. This dcGC-FB has been used to cancel the peripheral compression of normal hearing and thereby simulate the most common forms of hearing loss. This simulator allows normal hearing listeners to experience the difficulties of hearing impaired listeners. It has been used in training courses for speech-language-hearing therapists and psychoacoustic experiments. The dcGC-FB has also been used for modeling speaker size perception and predicting speech intelligibility with GEDI (the gammachirp envelope distortion index).

show abstract

Section: Resultsmentioning

confidence: 99%

“…We begin by defining the relationship between the ratio, f rat , and the total stimulus level at the output of the pGC, P gcp , as shown in Eq. (10). The slope of the ratio, f ð1Þ…”

Section: Compression In the Cgc And Its Inversementioning

confidence: 99%

Section: Frame-based Time-varying Filtermentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

The gammachirp auditory filter and its application to speech perception

Patterson

2020

Acoust. Sci. & Tech.

Self Cite

View full text Add to dashboard Cite

show abstract

“…The STOI is intended to assess the intelligibility of speech processed via an ideal time-frequency segregation (ITFS). It has been reported, however, that the STOI was not successful at predicting the intelligibility of speech sounds enhanced by via Wiener filtering [3] and a recent DNN-based enhancement algorithm [4].…”

Section: Introductionmentioning

confidence: 99%

Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech

Yamamoto

Ohashi²,

Araki

et al. 2018

Interspeech 2018

Self Cite

View full text Add to dashboard Cite

A multi-resolution version of the gammachirp envelope distortion index (mr-GEDI) is proposed for the intelligibility prediction of noisy speech processed using speech enhancement algorithms. The proposed model calculates the short-time signal-todistortion ratio in the temporal envelope modulation extracted from the output of the gammachirp auditory filterbank. The predictions were compared with human subjective results for various signal-to-noise ratio conditions with pink and babble noise. The mr-GEDI predicts the intelligibility curves better than the hearing-aid speech perception index (HASPI).

show abstract

Optimal Near-End Speech Intelligibility Improvement Using CLPSO-Based Voice Transformation in Realistic Noisy Environments

Biswas

Nathwani

2022

Circuits Syst Signal Process

View full text Add to dashboard Cite

Predicting Speech Intelligibility Using a Gammachirp Envelope Distortion Index Based on the Signal-to-Distortion Ratio

Cited by 10 publications

References 16 publications

The gammachirp auditory filter and its application to speech perception

The gammachirp auditory filter and its application to speech perception

Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech

Optimal Near-End Speech Intelligibility Improvement Using CLPSO-Based Voice Transformation in Realistic Noisy Environments

Contact Info

Product

Resources

About