2016
DOI: 10.1016/j.jcomdis.2016.07.001
|View full text |Cite
|
Sign up to set email alerts
|

Deriving gradient measures of child speech from crowdsourced ratings

Abstract: Recent research has demonstrated that perceptual ratings aggregated across multiple non-expert listeners can reveal gradient degrees of covert contrast between target and error sounds that listeners might transcribe identically. Aggregated ratings have been found to correlate strongly with acoustic gold standard measures both when individual raters use a continuous rating scale such as visual analog scaling (Munson, Johnson, & Edwards, 2012) and when individual raters provide binary ratings (McAllister Byun, H… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
20
1

Year Published

2016
2016
2021
2021

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 25 publications
(23 citation statements)
references
References 50 publications
2
20
1
Order By: Relevance
“…They found that binary ratings aggregated over 250 naïve listeners on AMT were highly correlated with binary ratings aggregated over 25 expert listeners ( r = 0.92) and with an acoustic measure of rhoticity, F3-F2 distance ( r = -0.79). Bootstrap analyses revealed that when nine or more AMT listeners were included in a subsample of raters, the aggregated AMT ratings converged with ratings aggregated over subsamples of three expert listeners, considered the “industry standard.” The accuracy of each token can be estimated by aggregating responses across raters using correct , which is the proportion of listeners who in a binary forced-choice task responded that the token was a “correct r sound” (McAllister Byun et al, 2016a). …”
Section: Methodsmentioning
confidence: 99%
“…They found that binary ratings aggregated over 250 naïve listeners on AMT were highly correlated with binary ratings aggregated over 25 expert listeners ( r = 0.92) and with an acoustic measure of rhoticity, F3-F2 distance ( r = -0.79). Bootstrap analyses revealed that when nine or more AMT listeners were included in a subsample of raters, the aggregated AMT ratings converged with ratings aggregated over subsamples of three expert listeners, considered the “industry standard.” The accuracy of each token can be estimated by aggregating responses across raters using correct , which is the proportion of listeners who in a binary forced-choice task responded that the token was a “correct r sound” (McAllister Byun et al, 2016a). …”
Section: Methodsmentioning
confidence: 99%
“…3 All participants had U.S.-based IP addresses and, per self-report, were native speakers of English with no history of speech or hearing impairment. When aggregating accuracy ratings across listeners, we usep correct , defined as the percentage of "correct" ratings out of all ratings, pooled across listeners (McAllister Byun, Harel, Halpin, & Szeredi, 2016).…”
Section: Probe Measurementmentioning
confidence: 99%
“…(Perceptual ratings are not feasible for Phase 1 because within-treatment trials may be prolonged or otherwise unnatural, posing a confound for accuracy ratings provided by untrained listeners.) Following protocols refined in our previous research [42,43], we will split each word probe recording into 50 word-level productions and pool these productions across speakers and time points. These stimuli will then be presented in a randomized order for online listeners who originate from US-based IP addresses and report speaking American English as their native language.…”
Section: Outcomes Measurementmentioning
confidence: 99%
“…Per previous research, ratings will be collected until at least 9 unique listeners have rated each token. The proportion of "correct" ratings out of the total number of ratings (p-correct), which correlates strongly with both expert listeners' ratings and acoustic measures [42,43], will serve as our primary perceptual measure of /ɹ/ production accuracy.…”
Section: Outcomes Measurementmentioning
confidence: 99%