Interspeech 2020 2020
DOI: 10.21437/interspeech.2020-1826
|View full text |Cite
|
Sign up to set email alerts
|

Blind Speech Signal Quality Estimation for Speaker Verification Systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
8
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
2
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(9 citation statements)
references
References 0 publications
1
8
0
Order By: Relevance
“…We modified the publicly available implementation of [16] to support uncertainty propagation and used the original implementation as a baseline 3 , further referred to as VBx.…”
Section: Bayesian Hmm With Uncertainty Propagationmentioning
confidence: 99%
See 1 more Smart Citation
“…We modified the publicly available implementation of [16] to support uncertainty propagation and used the original implementation as a baseline 3 , further referred to as VBx.…”
Section: Bayesian Hmm With Uncertainty Propagationmentioning
confidence: 99%
“…signal duration [1,2,3] may not be the most relevant characteristics in terms of recognition performance.…”
Section: Introductionmentioning
confidence: 99%
“…As an alternative method of separating speech and non-speech segments, we used the information about the acoustic characteristics of the signal from the microphone array of the AMI corpus. During evaluation, for each 0.5 s signal frame we estimated the SNR and RT60 parameters using the signal quality estimation (QE) model described in [9]. SNR and RT60 parameters were predicted for all 8 channels of the microphone array, so for each 0.5 s frame we obtained a vector of 16 values.…”
Section: Qe-vectors Systemmentioning
confidence: 99%
“…Recently, in addition to standard energy-based voice activity detectors, neural network-based VADs are gaining popularity, which allows one to obtain better resistance to noise conditions [8]. In this paper, we consider the use of the DNN-based VAD described in [9] for a diarization task applied to the AMI Meeting Corpus [10]. To deal with the inherent problems of multi-dialogue recordings such as speaker interruptions and the simultaneous utterances of multiple speakers, we examine options for fusing VAD with other speech analytic systems.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation