Twenty speakers, diagnosed as male-to-female transsexuals, produced conversational recordings of speech and voice. The samples were submitted to perceptual evaluations and to acoustic analysis by means of a Visi-Pitch, Apple IIe microcomputer system. Transsexuals categorized as having female voices had higher fundamental frequencies (fo), less extensive downward intonations, a higher percentage of upward intonations and downward shifts, and a smaller percentage of level intonations and level shifts than transsexuals categorized as having male voices. The lowest average fo identified as belonging to a female speaker was 155 Hz. Higher (more feminine) ratings on the masculinity-femininity dimension correlated with fo (r = .89), percentage of level shifts (r = -.67), percentage of downward shifts (r = .50), percentage of level intonations (r = -.43), and percentage of upward intonations (r = .40). Findings are discussed in terms of the relative perceptual salience of average fundamental frequency and patterns of intonation for female voice quality.
We hypothesized that acoustic measures would predict dysphonic severity with differential results for pathological voice types. An instructional program based upon synthesized voice signals was developed to facilitate an awareness of prototypical voice types. Eighty phonatory samples representing normal subjects as well as patients with unilateral vocal fold paralysis, vocal nodules, and functional dysphonia were analyzed acoustically on the basis of four measures: average fundamental frequency (F
0
), jitter, shimmer, and harmonic/noise ratio (H/N ratio). Following training, 29 listeners classified 62% of the phonatory samples on the basis of breathy, hoarse, rough, and normal. Dysphonic severity of rough voices was predicted more successfully by H/N ratio (
r
2
=.73) than by shimmer (
r
2
=.43). Dysphonic severity of breathy voices was predicted only by the combined features of less jitter, more shimmer, and lower H/N ratio (
r
2
=.74). No combination of acoustic variables was successful in the prediction of the hoarse voice type.
The relative effectiveness of three acoustic measures (jitter, shimmer, and harmonic/noise ratio) in predicting the dysphonic severity of a diverse clinical population singly and together was investigated. Phonatory samples were recorded from 20 normal subjects and 60 patients representing 3 laryngeal groups (nodules, paralysis, and functional). The phonatory samples were evaluated by 22 listeners using a 7-point equal-appearing interval scale. Shimmer produced a bivariate correlation of 0.54 with dysphonic severity; harmonic/noise ratio correlated −0.32 with dysphonic severity; and jitter produced no significant correlation with severity. The combination of acoustic variables through multiple regression analysis produced a correlation of 0.56, with only shimmer and average F
0
contributing to the correlation. For this particular clinical population, therefore, findings indicated that (a) none of the variables was strongly correlated with dysphonia ratings, and (b) a combination of acoustic predictors was no more successful than a single predictor of dysphonic severity, namely, shimmer.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.