Purpose:
The human voice is a powerful and evolved social tool, with hundreds of studies showing that nonverbal vocal parameters robustly influence listeners' perceptions of socially meaningful speaker traits, ranging from perceived gender and age to attractiveness and trustworthiness. However, these studies have utilized a wide variety of voice stimuli to measure listeners' voice-based judgments of these traits. Here, in the largest scale study known to date, we test whether listeners judge the same unseen speakers differently depending on the complexity of the neutral speech stimulus, from single vowel sounds to a full paragraph.
Method:
In a playback experiment testing 2,618 listeners, we examine whether commonly studied voice-based judgments of attractiveness, trustworthiness, dominance, likability, femininity/masculinity, and health differ if listeners hear isolated vowels, a series of vowels, single words, single sentences (greeting), counting from 1 to 10, or a full paragraph recited aloud (Rainbow Passage), recorded from the same 208 men and women. Data were collected using a custom-designed interface in which vocalizers and traits were randomly assigned to raters.
Results:
Linear-mixed models show that the type of voice stimulus does indeed consistently affect listeners' judgments. Overall, ratings of attractiveness, trustworthiness, dominance, likability, health, masculinity among men, and femininity among women increase as speech duration increases. At the same time, speaker-level regression analyses show that interindividual differences in perceived speaker traits are largely preserved across voice stimuli, especially among those of a similar duration.
Conclusions:
Socially relevant perceptions of speakers are not wholly changed but rather moderated by the length of their speech. Indeed, the same vocalizer is perceived in a similar way regardless of which neutral statements they speak, with the caveat that longer utterances explain the most shared variance in listeners' judgments and elicit the highest ratings on all traits, possibly by providing additional nonverbal information to listeners.
Supplemental Material:
https://doi.org/10.23641/asha.21158890