Optimal subband Kalman filter for normal and oesophageal speech enhancement

Ishaq, Rizwan; Zapirain, Begoña García

doi:10.3233/bme-141183

Cited by 2 publications

(1 citation statement)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Statistical conversion from ES to normal speech has also improved intelligibility, but requires more ES data [8]. Some other not so common approaches are based on Kalman filtering [9,10,11,12], and modulation filtering enhancement [13,14].…”

Section: Introductionmentioning

confidence: 99%

Vowel Enhancement in Early Stage Spanish Esophageal Speech Using Natural Glottal Flow Pulse and Vocal Tract Frequency Warping

Ishaq¹,

Gowda²,

Alku³

et al. 2015

Proceedings of SLPAT 2015: 6th Workshop on Speech and Language Processing for Assistive Technologies

Self Cite

View full text Add to dashboard Cite

This paper presents an enhancement system for early stage Spanish Esophageal Speech (ES) vowels. The system decomposes the input ES into neoglottal waveform and vocal tract filter components using Iterative Adaptive Inverse Filtering (IAIF). The neoglottal waveform is further decomposed into fundamental frequency F0, Harmonic to Noise Ratio (HNR), and neoglottal source spectrum. The enhanced neoglottal source signal is constructed using a natural glottal flow pulse computed from real speech. The F0 and HNR are replaced with natural speech F0 and HNR. The vocal tract formant frequencies (spectral peaks) and bandwidths are smoothed, the formants are shifted downward using second order frequency warping polynomial and the bandwidth is increased to make it close to the natural speech. The system is evaluated using subjective listening tests on the Spanish ES vowels /a/, /e/, /i/, /o/, /u/. The Mean Opinion Score (MOS) shows significant improvement in the overall quality (naturalness and intelligibility) of the vowels.

show abstract

Section: Introductionmentioning

confidence: 99%

Vowel Enhancement in Early Stage Spanish Esophageal Speech Using Natural Glottal Flow Pulse and Vocal Tract Frequency Warping

Ishaq¹,

Gowda²,

Alku³

et al. 2015

Proceedings of SLPAT 2015: 6th Workshop on Speech and Language Processing for Assistive Technologies

Self Cite

View full text Add to dashboard Cite

show abstract

Enhancement of Spanish Oesophageal Speech vowels using coherent subband modulator Kalman filtering

Ishaq¹,

Zapirain²

2016

THC

View full text Add to dashboard Cite

This paper proposes an Oesophageal Speech (OES) enhancement method, based on Kalman filtering. The Kalman filter is applied to modulators of OES frequency subbands instead of the fullband signal. The OES frequency subbands are decomposed into modulators and carriers components using coherent demodulation. In comparison with fullband Kalman filtering and pole stabilization, the proposed technique shows better results. The system performance is evaluated objectively and subjectively using the Harmonic to Noise Ratio (HNR) and Mean Opinion Score (MOS) respectively. Results have shown that Kalman filter in subband modulators processing is robust and efficient, improving the HNR by 4 to 5 dB for all Spanish vowels.

show abstract

Optimal subband Kalman filter for normal and oesophageal speech enhancement

Cited by 2 publications

References 29 publications

Vowel Enhancement in Early Stage Spanish Esophageal Speech Using Natural Glottal Flow Pulse and Vocal Tract Frequency Warping

Vowel Enhancement in Early Stage Spanish Esophageal Speech Using Natural Glottal Flow Pulse and Vocal Tract Frequency Warping

Enhancement of Spanish Oesophageal Speech vowels using coherent subband modulator Kalman filtering

Contact Info

Product

Resources

About