2008
DOI: 10.1016/j.specom.2007.09.002
|View full text |Cite
|
Sign up to set email alerts
|

Multisensory processing for speech enhancement and magnitude-normalized spectra for speech modeling

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2010
2010
2020
2020

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(10 citation statements)
references
References 22 publications
0
10
0
Order By: Relevance
“…In order to use the corresponding target speaker as the input speaker, i.e., optimization of reconstructed target spectra and/or performing target-to-source conversion, the notations of x and y, in Eqs. (1)- (7), are swapped with each other. Though, the performance of VAE-based VC is noticeably insufficient because the conversion flow is not considered in the parameter optimization.…”
Section: Conventional Vae-based Vcmentioning
confidence: 99%
“…In order to use the corresponding target speaker as the input speaker, i.e., optimization of reconstructed target spectra and/or performing target-to-source conversion, the notations of x and y, in Eqs. (1)- (7), are swapped with each other. Though, the performance of VAE-based VC is noticeably insufficient because the conversion flow is not considered in the parameter optimization.…”
Section: Conventional Vae-based Vcmentioning
confidence: 99%
“…The results have been evaluated for GH, LP, optimally modified log spectral amplitude (OM-LSA) [2] and an existing probabilistic approach (PA) [14]. Table 3 presents the LSD results for Gaussian noise with different SNR levels and interfering speech, obtained by using four different speech enhancement methods: GH, LP, OM-LSA and PA.…”
Section: Resultsmentioning
confidence: 99%
“…Subramanya et al . proposed a statistical feature mapping technique for achieving good noise suppression. Liu et al .…”
Section: Introductionmentioning
confidence: 99%