2020
DOI: 10.1515/lingvan-2019-0058
|View full text |Cite
|
Sign up to set email alerts
|

Comparing the performance of forced aligners used in sociophonetic research

Abstract: AbstractForced aligners have revolutionized sociophonetics, but while there are several forced aligners available, there are few systematic comparisons of their performance. Here, we consider four major forced aligners used in sociophonetics today: MAUS, FAVE, LaBB-CAT and MFA. Through comparisons with human coders, we find that both aligner and phonological context affect the quality of automated alignments of vowels extracted from English sociolinguistic interview data. MFA a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(2 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…In a paper comparing the performance of forced aligners with Australian English, as well as a second human coder, Gonzalez et al (2020) showed that the human coders were most alike and accurate in their performance, at around 80% agreement in this paper compared to between 65and 53% for the ASR systems. They also showed the ASR systems made errors depending on particular phonetic environments, whereas crucially, human coders were not prone to such errors.…”
Section: Introductionmentioning
confidence: 68%
“…In a paper comparing the performance of forced aligners with Australian English, as well as a second human coder, Gonzalez et al (2020) showed that the human coders were most alike and accurate in their performance, at around 80% agreement in this paper compared to between 65and 53% for the ASR systems. They also showed the ASR systems made errors depending on particular phonetic environments, whereas crucially, human coders were not prone to such errors.…”
Section: Introductionmentioning
confidence: 68%
“…With such data, there is a higher risk that completely automated methods could yield spurious measurements. A middle ground is to check a random sample of vowel tokens, particularly those that occur in phonological contexts that disproportionately impact alignment (see Gonzalez, Grama, and Travis 2020). In my view, claims about the behavior of vowel trajectories necessitate more accurate alignments, given these concerns.…”
Section: Extracting Formant Valuesmentioning
confidence: 99%