2022
DOI: 10.1109/taslp.2022.3190725
|View full text |Cite
|
Sign up to set email alerts
|

Improved CEM for Speech Harmonic Enhancement in Single Channel Noise Suppression

Abstract: The periodic nature of voiced speech is often exploited to restore speech harmonics and to increase interharmonic noise suppression. In particular, a recent paper proposed to do this by manipulating the speech harmonic frequencies in the cepstral domain. The manipulations were carried out on the cepstrum of the excitation signal, obtained by the sourcefilter decomposition of speech. This method was termed Cepstral Excitation Manipulation (CEM). In this contribution we further analyse this method, point out its… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
13
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(14 citation statements)
references
References 19 publications
1
13
0
Order By: Relevance
“…Then, according to the source-filter model, the enhanced signal is decomposed into the excitation signal and the envelope , and each component can be enhanced individually. The enhancement of the speech excitation signal has been discussed in References [ 4 , 5 , 23 ], showing that the idealised excitation signal brings the benefit of recovering the weak or lost harmonics in the initial speech estimate.…”
Section: Speech Enhancement Frameworkmentioning
confidence: 99%
See 4 more Smart Citations
“…Then, according to the source-filter model, the enhanced signal is decomposed into the excitation signal and the envelope , and each component can be enhanced individually. The enhancement of the speech excitation signal has been discussed in References [ 4 , 5 , 23 ], showing that the idealised excitation signal brings the benefit of recovering the weak or lost harmonics in the initial speech estimate.…”
Section: Speech Enhancement Frameworkmentioning
confidence: 99%
“…While the excitation signal can be modeled by straightforward mathematical equations due to its periodic nature in the voiced frames with the largest energy [ 4 , 5 , 23 ], data-driven methods are more common in the estimation of the speech envelopes as in References [ 10 , 11 , 12 , 13 ]. If the underlying clean-speech envelope can be accurately estimated from the distorted or noisy signal envelope, it should improve the final speech estimate.…”
Section: Speech Enhancement Frameworkmentioning
confidence: 99%
See 3 more Smart Citations