Proceedings of the 5th Workshop on Speech and Language Processing for Assistive Technologies 2014
DOI: 10.3115/v1/w14-1905
|View full text |Cite
|
Sign up to set email alerts
|

Individuality-preserving Voice Conversion for Articulation Disorders Using Dictionary Selective Non-negative Matrix Factorization

Abstract: We present in this paper a voice conversion (VC) method for a person with an articulation disorder resulting from athetoid cerebral palsy. The movements of such speakers are limited by their athetoid symptoms, and their consonants are often unstable or unclear, which makes it difficult for them to communicate. In this paper, exemplar-based spectral conversion using Nonnegative Matrix Factorization (NMF) is applied to a voice with an articulation disorder. In order to preserve the speaker's individuality, we us… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
3
3

Relationship

3
3

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 23 publications
0
5
0
Order By: Relevance
“…Over the years, voice conversion frameworks have mostly focused on spectral conversion between source and target speakers [8,21]. In the sense of the statistical parametric approaches, such as Gaussian mixture model (GMM) [59] and exemplar based on nonnegative matrix factorization [1,63], SVC showed a success in the linear transformation of the spectral information. Nonlinear transformation approaches, such as hidden Markov models (HMMs) [49], deep belief networks (DBNs) [45] and restricted Boltzmann machines (RBMs) [46], have been also shown to be effective in modeling the relationship between source-target features more accurately.…”
Section: Introductionmentioning
confidence: 99%
“…Over the years, voice conversion frameworks have mostly focused on spectral conversion between source and target speakers [8,21]. In the sense of the statistical parametric approaches, such as Gaussian mixture model (GMM) [59] and exemplar based on nonnegative matrix factorization [1,63], SVC showed a success in the linear transformation of the spectral information. Nonlinear transformation approaches, such as hidden Markov models (HMMs) [49], deep belief networks (DBNs) [45] and restricted Boltzmann machines (RBMs) [46], have been also shown to be effective in modeling the relationship between source-target features more accurately.…”
Section: Introductionmentioning
confidence: 99%
“…Voice conversion (VC) has been widely used in many speech processing tasks, such as speaking assistance [1], speech enhancement [2] and other applications [3], [4]. Therefore, the need for this type of technology in various fields has continued to propel related studies each year.…”
Section: Introductionmentioning
confidence: 99%
“…This technology can be widely applied in various application domains. For instances, emotion conversion [1], speaking assistance [2], and other applications [3] [4]. Therefore, the need for this type of technology in various fields has continued to propel related researches each year.…”
Section: Introductionmentioning
confidence: 99%
“…Among these approaches, a Gaussian Mixture Model (GMM) is widely used, and a number of improvements have been proposed [7] [8] for GMM-based voice conversion. Other VC methods, such as approaches based on non-negative matrix factorization (NMF) [9] [2] have also been proposed. The NMF and GMM methods are based on linear functions.…”
Section: Introductionmentioning
confidence: 99%