2015
DOI: 10.1186/s13636-015-0067-4
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal voice conversion based on non-negative matrix factorization

Abstract: A multimodal voice conversion (VC) method for noisy environments is proposed. In our previous non-negative matrix factorization (NMF)-based VC method, source and target exemplars are extracted from parallel training data, in which the same texts are uttered by the source and target speakers. The input source signal is then decomposed into source exemplars, noise exemplars, and their weights. Then, the converted speech is constructed from the target exemplars and the weights related to the source exemplars. In … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2016
2016
2016
2016

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 29 publications
0
2
0
Order By: Relevance
“…The noise in the input signal is not only output with the converted signal, but may also degrade the conversion performance itself due to unexpected mapping of source features. In [16], the experimental result proved the effectiveness of visual features in the task of VC in noisy environments.…”
Section: Introductionmentioning
confidence: 77%
See 1 more Smart Citation
“…The noise in the input signal is not only output with the converted signal, but may also degrade the conversion performance itself due to unexpected mapping of source features. In [16], the experimental result proved the effectiveness of visual features in the task of VC in noisy environments.…”
Section: Introductionmentioning
confidence: 77%
“…In [16], we have proposed an exemplar-based multimodal Voice Conversion (VC) method. In the task of automatic speech recognition (ASR), one problem is that the recognition performance remarkably decreases under noisy environments, and it becomes a significant problem seeking to develop a practical use of ASR.…”
Section: Introductionmentioning
confidence: 99%