Validity of rating scale measures of voice quality

Kreiman, Jody; Gerratt, Bruce R.

doi:10.1121/1.424372

Cited by 151 publications

(99 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…It has also been reported that perceptual rating of vocal quality, such as hoarseness, is particularly difficult and thus less reliable than expected. 17 Future research involving larger numbers of participants exhibiting a wider range of severity of the speech disturbance may achieve different outcomes.…”

Section: Discussionmentioning

confidence: 99%

Treating the speech disorder in Parkinson's disease online

et al. 2006

View full text Add to dashboard Cite

SummaryThe Lee Silverman Voice Treatment (LSVT) has been shown to be highly effective in treating the speech disorder in Parkinson's Disease (PD). However, patient access to this treatment remains limited in Australia, due to availability of speech pathologists, patient mobility and distance issues. We have investigated the feasibility and effectiveness of an Internet-based telerehabilitation application (eREHAB) for the delivery of the LSVT to persons with PD and disordered speech. Ten participants with PD and dysarthria were treated online with the LSVT for a total of 16 sessions. There were significant improvements in sound pressure levels for vowel prolongation, reading and conversational monologue (Po0.01), pitch range (Po0.05) and in perceptual features of pitch and loudness variability, loudness level (Po0.01) and breathiness (Po0.05). A participant satisfaction questionnaire indicated that 70% of participants expressed overall satisfaction with the online treatment. Telerehabilitation was feasible and effective in delivering the LSVT to people with PD.

show abstract

Section: Discussionmentioning

confidence: 99%

Treating the speech disorder in Parkinson's disease online

et al. 2006

View full text Add to dashboard Cite

show abstract

“…Auditory-perceptual methods have fundamental problems. First, the reliability and validity of auditory-perceptual methods is often lower than desirable (e.g., Kent, 1996; Kreiman & Gerratt, 1998), due to a variety of factors. For example, it is difficult to judge one aspect of speech without interference from other aspects (e.g., judging nasality in the presence of varying degrees of hoarseness); certain judgment categories are intrinsically multidimensional, thus requiring each judge to weigh subjectively and individually these dimensions; and there is a paucity of reference standards.…”

mentioning

confidence: 99%

Computational prosodic markers for autism

Santen

Prud’hommeaux

Black

2010

Autism

View full text Add to dashboard Cite

We present results obtained with new instrumental methods for the acoustic analysis of prosody to evaluate prosody production by children with Autism Spectrum Disorder (ASD) and Typical Development (TD). Two tasks elicit focal stress, one in a vocal imitation paradigm, the other in a picture-description paradigm; a third task also uses a vocal imitation paradigm, and requires repeating stress patterns of two-syllable nonsense words. The instrumental methods differentiated significantly between the ASD and TD groups in all but the focal stress imitation task. The methods also showed smaller differences in the two vocal imitation tasks than in the picture-description task, as was predicted. In fact, in the nonsense word stress repetition task, the instrumental methods showed better performance for the ASD group. The methods also revealed that the acoustic features that predict auditory-perceptual judgment are not the same as those that differentiate between groups. Specifically, a key difference between the groups appears to be a difference in the balance between the various prosodic cues, such as pitch, amplitude, and duration, and not necessarily a difference in the strength or clarity with which prosodic contrasts are expressed

show abstract

“…In that study, similarities among voices were not well predicted by traditional rating scales, or indeed by any set of static phonetic or linguistic-style ''features.'' Other studies ͑e.g., Kreiman et al, 1993;Gerratt et al, 1993;Kreiman and Gerratt, 1998͒ suggest that problems with traditional voice assessment protocols may be due to factors in addition to or instead of scale validity. For example, individual listeners are reasonably self-consistent in their judgments of specific aspects of vocal quality, but across listeners more than 60% ͑and as much as 78%͒ of the variance in ratings of voices may be due to factors other than differences among voices in the quality being rated ͑Kreiman and Gerratt, 1998͒.…”

Section: Introductionmentioning

confidence: 99%

Sources of listener disagreement in voice quality assessment

Kreiman¹,

Gerratt²

2000

The Journal of the Acoustical Society of America

Self Cite

128

View full text Add to dashboard Cite

Traditional interval or ordinal rating scale protocols appear to be poorly suited to measuring vocal quality. To investigate why this might be so, listeners were asked to classify pathological voices as having or not having different voice qualities. It was reasoned that this simple task would allow listeners to focus on the kind of quality a voice had, rather than how much of a quality it possessed, and thus might provide evidence for the validity of traditional vocal qualities. In experiment 1, listeners judged whether natural pathological voice samples were or were not primarily breathy and rough. Listener agreement in both tasks was above chance, but listeners agreed poorly that individual voices belonged in particular perceptual classes. To determine whether these results reflect listeners' difficulty agreeing about single perceptual attributes of complex stimuli, listeners in experiment 2 classified natural pathological voices and synthetic stimuli ͑varying in f 0 only͒ as low pitched or not low pitched. If disagreements derive from difficulties dividing an auditory continuum consistently, then patterns of agreement should be similar for both kinds of stimuli. In fact, listener agreement was significantly better for the synthetic stimuli than for the natural voices. Difficulty isolating single perceptual dimensions of complex stimuli thus appears to be one reason why traditional unidimensional rating protocols are unsuited to measuring pathologic voice quality. Listeners did agree that a few aphonic voices were breathy, and that a few voices with prominent vocal fry and/or interharmonics were rough. These few cases of agreement may have occurred because the acoustic characteristics of the voices in question corresponded to the limiting case of the quality being judged. Values of f 0 that generated listener agreement in experiment 2 were more extreme for natural than for synthetic stimuli, consistent with this interpretation.

show abstract

Validity of rating scale measures of voice quality

Cited by 151 publications

References 27 publications

Treating the speech disorder in Parkinson's disease online

Treating the speech disorder in Parkinson's disease online

Computational prosodic markers for autism

Sources of listener disagreement in voice quality assessment

Contact Info

Product

Resources

About