2014
DOI: 10.1587/transinf.e97.d.1411
|View full text |Cite
|
Sign up to set email alerts
|

Noise-Robust Voice Conversion Based on Sparse Spectral Mapping Using Non-negative Matrix Factorization

Abstract: SUMMARYThis paper presents a voice conversion (VC) technique for noisy environments based on a sparse representation of speech. Sparse representation-based VC using Non-negative matrix factorization (NMF) is employed for noise-added spectral conversion between different speakers. In our previous exemplar-based VC method, source exemplars and target exemplars are extracted from parallel training data, having the same texts uttered by the source and target speakers. The input source signal is represented using t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2015
2015
2019
2019

Publication Types

Select...
4
3
1

Relationship

3
5

Authors

Journals

citations
Cited by 13 publications
(8 citation statements)
references
References 19 publications
(27 reference statements)
0
8
0
Order By: Relevance
“…In order to retain the quality of converted voices in a noisy environment, noise robustness is required. In our previous study [46], a noise-robust NMF-based VC was proposed, where the performance was improved by 25% compared with the GMM-based method. As the currently proposed method is based on NMF-based VC, it will be easy to apply the noise-robust conversion.…”
Section: Resultsmentioning
confidence: 99%
“…In order to retain the quality of converted voices in a noisy environment, noise robustness is required. In our previous study [46], a noise-robust NMF-based VC was proposed, where the performance was improved by 25% compared with the GMM-based method. As the currently proposed method is based on NMF-based VC, it will be easy to apply the noise-robust conversion.…”
Section: Resultsmentioning
confidence: 99%
“…Wu et al proposed a method for NMF-based VC to reduce the computational cost [35]. In [4], we also proposed a frame-work that reduces computational time for NMF-based VC. We will combine these methods and investigate the optimal number of bases for better performance.…”
Section: Resultsmentioning
confidence: 99%
“…Wu et al [35] applied a spectrum compression factor to NMF-based VC to improve conversion quality. Second, our NMF-based VC method is noise robust [4]. The noise exemplars, which are extracted from the before-and after-utterance sections in the observed signal, are used as the noise dictionary, and the VC process is combined with NMF-based noise reduction.…”
Section: Related Workmentioning
confidence: 99%
“…The proposed method requires higher computation times than the GMM-based method. In [25], we proposed a framework that reduces computational time for NMF-based VC. In future work, we will investigate the optimal number of bases and evaluate performance under other noise conditions.…”
Section: Discussionmentioning
confidence: 99%