Two existing speech conversion algorithms were modified and used to enhance alaryngeal speech. The modifications were aimed at reducing spectral distortion (bandwidth increase) in a vector-quantization (VQ) based system and the spectral discontinuity in a linear multivariate regression (LMR) based system. Spectral distortion was compensated for by formant enhancement using chirp z-transform and cepstral weighting. Spectral discontinuity was alleviated using overlapping clusters during the construction of conversion mapping function. The modified VQ and LMR algorithms were used to enhance alaryngeal speech. Results of perceptual evaluation indicated that listeners generally preferred to listen to the alaryngeal speech samples enhanced by the modified conversions over original samples.
Index Terms-Speech enhancement, speech conversion, speech analysis and synthesis, vector quantization, linear multivariate regression