Interspeech 2017 2017
DOI: 10.21437/interspeech.2017-1508
|View full text |Cite
|
Sign up to set email alerts
|

Mapping Across Feature Spaces in Forensic Voice Comparison: The Contribution of Auditory-Based Voice Quality to (Semi-)Automatic System Testing

Abstract: In forensic voice comparison, there is increasing focus on the integration of automatic and phonetic methods to improve the validity and reliability of voice evidence to the courts. In line with this, we present a comparison of long-term measures of the speech signal to assess the extent to which they capture complementary speaker-specific information. Likelihood ratiobased testing was conducted using MFCCs and (linear and Melweighted) long-term formant distributions (LTFDs). Fusing automatic and semi-automati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
7
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
3

Relationship

2
4

Authors

Journals

citations
Cited by 12 publications
(10 citation statements)
references
References 20 publications
2
7
1
Order By: Relevance
“…nasality, creakiness) increased, decreased or remained the same when comparing high quality recordings and telephone-degraded recordings. We have investigated possible correlations between different long-term vocal tract output measures -including supralaryngeal settings -with the aim of finding how VQ analysis can complement longterm formant distributions (LTFDs) and Mel frequency cepstral coefficients calculated across entire speech samples (MFCCs; French et al 2015, Hughes et al 2017.…”
Section: The Vocal Profile Analysis: Applications Issues and Challengesmentioning
confidence: 99%
“…nasality, creakiness) increased, decreased or remained the same when comparing high quality recordings and telephone-degraded recordings. We have investigated possible correlations between different long-term vocal tract output measures -including supralaryngeal settings -with the aim of finding how VQ analysis can complement longterm formant distributions (LTFDs) and Mel frequency cepstral coefficients calculated across entire speech samples (MFCCs; French et al 2015, Hughes et al 2017.…”
Section: The Vocal Profile Analysis: Applications Issues and Challengesmentioning
confidence: 99%
“…Recordings were drawn from the DyViS corpus of young male standard southern British English speakers [12]. Of the 100 available speakers, 97 were used, based on prior testing outlined in [13]. SASR testing was carried out under four different conditions according to the technical quality of the offender sample.…”
Section: 1! Materialsmentioning
confidence: 99%
“…The suspect recording (Task1) and the four versions of the offender recording (Task2) were prepared for analysis in the same way as described in ¤2.2 of [13]. This involved manual editing of recordings to remove non-speech sounds and overlapping speech, removal of sections containing clipping, voice activity detection to remove silences of greater than 100ms (using the vadsohn function in the VOICEBOX toolkit [15]), and segmentation of the signal into consonants and vowels using stkCV [16].…”
Section: 2! Preparation Of Recordingsmentioning
confidence: 99%
See 2 more Smart Citations