2014
DOI: 10.1587/transinf.e97.d.1429
|View full text |Cite
|
Sign up to set email alerts
|

A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation

Abstract: SUMMARYThis paper presents an electrolaryngeal (EL) speech enhancement method capable of significantly improving naturalness of EL speech while causing no degradation in its intelligibility. An electrolarynx is an external device that artificially generates excitation sounds to enable laryngectomees to produce EL speech. Although proficient laryngectomees can produce quite intelligible EL speech, it sounds very unnatural due to the mechanical excitation produced by the device. Moreover, the excitation sounds p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
17
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
5
1

Relationship

3
3

Authors

Journals

citations
Cited by 25 publications
(19 citation statements)
references
References 14 publications
(19 reference statements)
2
17
0
Order By: Relevance
“…11. As reported in [9], we confirmed that Batch is significantly improved compared with EL by predicting F 0 patterns based on statistical F 0 patterns. For our proposed methods RT and Forthcoming, we achieved that two proposed systems caused no degradation compared with Batch.…”
Section: Naturalness Of Predicted F 0 Patternssupporting
confidence: 87%
See 4 more Smart Citations
“…11. As reported in [9], we confirmed that Batch is significantly improved compared with EL by predicting F 0 patterns based on statistical F 0 patterns. For our proposed methods RT and Forthcoming, we achieved that two proposed systems caused no degradation compared with Batch.…”
Section: Naturalness Of Predicted F 0 Patternssupporting
confidence: 87%
“…We found that reducing the variability of F 0 patterns such as rapid movements, we achieved to train F 0 patterns with a smaller number of mixture components. Moreover, as reported in [9], we also confirmed that CF 0 brings better performance compared with the original F 0 because continuous sequence makes it possible to consider inter-frame correlation over an utterance. The proposed segmented CF 0 preserves such an improvement relatively well while minimizing degradation of the prediction accuracy.…”
Section: Best Number Of Mixture Componentssupporting
confidence: 87%
See 3 more Smart Citations