Age-Related Voice Disguise and its Impact on Speaker Verification Accuracy

Hautamäki, Rosa González; Sahidullah, Md; Kinnunen, Tomi; Hautamäki, Ville

doi:10.21437/odyssey.2016-40

Cited by 2 publications

(7 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Voice disguise is a complex problem that has attracted interest from different research communities. Previous studies on the topic enable one to identify Our preliminary analyses of the effects of voice disguise on modern ASV 60 systems was reported in (González Hautamäki et al, 2016). The experiments indicated the vulnerability of our ASV systems in the presence of disguised voices when the speakers intended old and young voices.…”

Section: Accepted Manuscriptmentioning

confidence: 79%

“…The second element is to compare the performance of native and non-native listeners for its relevance in a forensic setting such as voice-lineups, in which the listeners may be unfamiliar with the speaker's language. Previous studies confirm that the reliability of non-native listeners decreases in speaker recognition tasks (Eriksson 100 et al, 2010;Köster et al, 1997) The dataset used for this study was collected by the authors and is the same that was used in our preliminary study (González Hautamäki et al, 2016). Our data consists of speech from 60 native Finnish speakers with 31 female and 110 29 male speakers.…”

Section: Accepted Manuscriptmentioning

confidence: 88%

“…In addition to the acoustic analysis, we designed a perceptual experiment to benchmark the performance of human speaker verification accuracy under 90 voice disguise. Our perceptual task includes two novel elements, first, a selection of speech sample pairs, or trials, using the results from the ASV systems implemented in our previous study (González Hautamäki et al, 2016). More specifically, we use the ASV system output to select easy, intermediate and difficult speaker pairs.…”

Section: Accepted Manuscriptmentioning

confidence: 99%

“…In the same context, our previous study (González Hautamäki et al, 2016) 190 evaluates the performance of six ASV systems. In terms of equal error rate (EER), the ASV systems' configuration performance was degraded with disguised voices.…”

mentioning

confidence: 99%

“…It therefore only considers the close-talking microphone speech, which has the highest recording quality. Interested readers are pointed to our earlier study (González Hautamäki et al, 2016) in which we analyzed the effect of smart-phone recordings on the accuracy of automatic speaker recogni-400 tion. The recording set-up is illustrated in Figure 3.…”

mentioning

confidence: 99%

See 4 more Smart Citations

Acoustical and perceptual study of voice disguise by age modification in speaker verification

Hautamäki

Sahidullah

Hautamäki

et al. 2017

Speech Communication

Self Cite

View full text Add to dashboard Cite

The task of speaker recognition is feasible when the speakers are cooperative or wish to be recognized. While modern automatic speaker verification (ASV) systems and some listeners are good at recognizing speakers from modal, unmodified speech, the task becomes notoriously difficult in situations of deliberate voice disguise when the speaker aims at masking his or her identity. We approach voice disguise from the perspective of acoustical and perceptual analysis using a self-collected corpus of 60 native Finnish speakers (31 female, 29 male) producing utterances in normal, intended young and intended old voice modes.

show abstract

Section: Accepted Manuscriptmentioning

confidence: 79%

Section: Accepted Manuscriptmentioning

confidence: 88%

Section: Accepted Manuscriptmentioning

confidence: 99%

mentioning

confidence: 99%

mentioning

confidence: 99%

See 3 more Smart Citations

Acoustical and perceptual study of voice disguise by age modification in speaker verification

Hautamäki

Sahidullah

Hautamäki

et al. 2017

Speech Communication

Self Cite

View full text Add to dashboard Cite

show abstract

On the limits of automatic speaker verification: Explaining degraded recognizer scores through acoustic changes resulting from voice disguise

Hautamäki

Kinnunen

2019

The Journal of the Acoustical Society of America

Self Cite

View full text Add to dashboard Cite

In speaker verification research, objective performance benchmarking of listeners and automatic speaker verification (ASV) systems are of key importance in understanding the limits of speaker recognition. While the adoption of common data and metrics has been instrumental to progress in ASV, there are two major shortcomings. First, the utterances lack intentional voice changes imposed by the speaker. Second, the standard evaluation metrics focus on average performance across all speakers and trials. As a result, a knowledge gap remains in how the acoustic changes impact recognition performance at the level of individual speakers. This paper addresses the limits of speaker recognition in ASV systems under voice disguise using a linear mixed effects model to analyze the impact of change in long-term statistics of selected features (formants F1-F4, the bandwidths B1-B4, F0, and speaking rate) to ASV log-likelihood ratio (LLR) score. The correlations between the proposed predictive model and the LLR scores are 0.72 for females and 0.81 for male speakers. As a whole, the difference in long-term F0 between enrollment and test utterances was found to be the individually most detrimental factor, even if the ASV system uses only spectral, rather than prosodic, features. V

show abstract

Age-Related Voice Disguise and its Impact on Speaker Verification Accuracy

Cited by 2 publications

References 7 publications

Acoustical and perceptual study of voice disguise by age modification in speaker verification

Acoustical and perceptual study of voice disguise by age modification in speaker verification

On the limits of automatic speaker verification: Explaining degraded recognizer scores through acoustic changes resulting from voice disguise

Contact Info

Product

Resources

About