2017
DOI: 10.1109/tbme.2016.2644258
|View full text |Cite
|
Sign up to set email alerts
|

Joint Dictionary Learning-Based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery

Abstract: This paper focuses on machine learning based voice conversion (VC) techniques for improving the speech intelligibility of surgical patients who have had parts of their articulators removed. Because of the removal of parts of the articulator, a patient's speech may be distorted and difficult to understand. To overcome this problem, VC methods can be applied to convert the distorted speech such that it is clear and more intelligible. To design an effective VC method, two key points must be considered: 1) the amo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
31
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 45 publications
(31 citation statements)
references
References 34 publications
0
31
0
Order By: Relevance
“…The computational modeling is mechanized based on an effective training-classification paradigm. The experimental analysis performed clearly shows that the system can effectively recognize and classify the speech signal prime entities as compared to existing baselines [13]. There exist several applications which process lower operational features to synthesize speech signals.…”
Section: Related Workmentioning
confidence: 89%
“…The computational modeling is mechanized based on an effective training-classification paradigm. The experimental analysis performed clearly shows that the system can effectively recognize and classify the speech signal prime entities as compared to existing baselines [13]. There exist several applications which process lower operational features to synthesize speech signals.…”
Section: Related Workmentioning
confidence: 89%
“…This method used two types of features, multiple-frame lowresolution exemplars and single-frame high-resolution exemplars, to estimate the activation matrix simultaneously. Fu et al [14] introduced joint dictionary learning-based NMF (JD-NMF) used to improve patients' speech intelligibility after oral surgery. The JD-NMF was designed to learn the source and the target dictionaries simultaneously from the common activation matrix.…”
Section: Joint Nmf (J-nmf)mentioning
confidence: 99%
“…In another method, the matrices A and Z are learned from the training data by alternately updating one matrix while keeping the other matrix fixed. The size of the constructed dictionary using this method is significantly reduced relative to that in the exemplar-based NMF method, resulting in improved online conversion efficiency [3].…”
Section: Nmf-based Vcmentioning
confidence: 99%
“…The baseline system is the NMF-based voice conversion system using parallel data described in [3]. The dictionaries have r = 100 bases.…”
Section: Baseline Systemmentioning
confidence: 99%