Background: Progressive supranuclear palsy syndrome (PSPS) and corticobasal syndrome (CBS) as well as non-fluent/agrammatic primary progressive aphasia (naPPA) are often associated with misfolded 4-repeat tau pathology, but the diversity of the associated speech features is poorly understood. Objective: Investigate the full range of acoustic and lexical properties of speech to test the hypothesis that PSPS-CBS show a subset of speech impairments found in naPPA. Methods: Acoustic and lexical measures, extracted from natural, digitized semi-structured speech samples using novel, automated methods, were compared in PSPS-CBS (n = 87), naPPA (n = 25), and healthy controls (HC, n = 41). We related these measures to grammatical performance and speech fluency, core features of naPPA, to neuropsychological measures of naming, executive, memory and visuoconstructional functioning, and to cerebrospinal fluid (CSF) phosphorylated tau (pTau) levels in patients with available biofluid analytes. Results: Both naPPA and PSPS-CBS speech produced shorter speech segments, longer pauses, higher pause rates, reduced fundamental frequency (f0) pitch ranges, and slower speech rate compared to HC. naPPA speech was distinct from PSPS-CBS with shorter speech segments, more frequent pauses, slower speech rate, reduced verb production, and higher partial word production. In both groups, acoustic duration measures generally correlated with speech fluency, measured as words per minute, and grammatical performance. Speech measures did not correlate with standard neuropsychological measures. CSF pTau levels correlated with f0 range in PSPS-CBS and naPPA. Conclusion: Lexical and acoustic speech features of PSPS-CBS overlaps those of naPPA and are related to CSF pTau levels.
The letter-guided naming fluency task is a measure of an individual’s executive function and working memory. This study employed a novel, automated, quantifiable, and reproducible method to investigate how language characteristics of words produced during a fluency task are related to fluency performance, inter-word response time (RT), and over task duration using digitized F-letter-guided fluency recordings produced by 76 young healthy participants. Our automated algorithm counted the number of correct responses from the transcripts of the F-letter fluency data, and individual words were rated for concreteness, ambiguity, frequency, familiarity, and age of acquisition (AoA). Using a forced aligner, the transcripts were automatically aligned with the corresponding audio recordings. We measured inter-word RT, word duration, and word start time from the forced alignments. Articulation rate was also computed. Phonetic and semantic distances between two consecutive F-letter words were measured. We found that total F-letter score was significantly correlated with the mean values of word frequency, familiarity, AoA, word duration, phonetic similarity, and articulation rate; total score was also correlated with an individual’s standard deviation of AoA, familiarity, and phonetic similarity. RT was negatively correlated with frequency and ambiguity of F-letter words and was positively correlated with AoA, number of phonemes, and phonetic and semantic distances. Lastly, the frequency, ambiguity, AoA, number of phonemes, and semantic distance of words produced significantly changed over time during the task. The method employed in this paper demonstrates the successful implementation of our automated language processing pipelines in a standardized neuropsychological task. This novel approach captures subtle and rich language characteristics during test performance that enhance informativeness and cannot be extracted manually without massive effort. This work will serve as the reference for letter-guided category fluency production similarly acquired in neurodegenerative patients.
Purpose This study examines the effect of age on language use with an automated analysis of digitized speech obtained from semistructured, narrative speech samples. Method We examined the Cookie Theft picture descriptions produced by 37 older and 76 young healthy participants. Using modern natural language processing and automatic speech recognition tools, we automatically annotated part-of-speech categories of all tokens, calculated the number of tense-inflected verbs, mean length of clause, and vocabulary diversity, and we rated nouns and verbs for five lexical features: word frequency, familiarity, concreteness, age of acquisition, and semantic ambiguity. We also segmented the speech signals into speech and silence and calculated acoustic features, such as total speech time, mean speech and pause segment durations, and pitch values. Results Older speakers produced significantly more fillers, pronouns, and verbs and fewer conjunctions, determiners, nouns, and prepositions than young participants. Older speakers' nouns and verbs were more familiar, more frequent (verbs only), and less ambiguous compared to those of young speakers. Older speakers produced shorter clauses with a lower vocabulary diversity than young participants. They also produced shorter speech segments and longer pauses with increased total speech time and total number of words. Lastly, we observed an interaction of age and sex in pitch ranges. Conclusions Our results suggest that older speakers' lexical content is less diverse, and these speakers produce shorter clauses than young participants in monologic, narrative speech. Our findings show that lexical and acoustic characteristics of semistructured speech samples can be examined with automated methods.
Background Speech is a complex activity requiring proper function and connectivity of multiple brain networks and as such is sensitive to focal neurodegeneration. We have previously reported on acoustic markers of dysprosody in speech samples of speakers with frontotemporal dementia (FTD) phenotypes. In the current study we explore the longitudinal changes in acoustic‐prosodic markers in FTD. Method We analyzed 102 speech samples of picture descriptions from 46 participants with FTD (Table 1): 8 with non‐fluent/agrammatic primary progressive aphasia (naPPA), 14 with semantic variant PPA (svPPA), 10 with logopenic aphasia (lvPPA) and 14 with behavioral FTD (bvFTD). We automatically segmented the acoustic signal into segments of continuous speech or silence, measured their durations, and derived other measures. We used linear mixed effects (lme) models to test changes over time for each acoustic measure, controlling for sex, education, and random intercepts. We also examined any interaction between phenotypes and disease duration. Result bvFTD speakers increased their pause duration by 0.27 seconds per year and their pause rate by 3.9 pauses per minute (ppm) each year. Their speech segment duration shortened by 0.1 seconds per year (p=0.041), decreasing their total speech time by 6.6 seconds (p=0.003) per year. Thus, bvFTD patients reduced the proportion of speech in their samples by 5.16 percent per year (p=0.008). svPPA speakers increased their pause rate similarly, but in contrast, their pause duration decreased by 0.097 seconds per year and they increased their speech segment frequency by 8.32 per minute each year (p=0.054). naPPA and lvPPA speakers increased their pause rate over time and spent less total time (speech + pause) describing the picture (by 5.6 seconds per year; p=0.018). They did not differ from bvFTD and svPPA in these two acoustic measures. Conclusion In our study all FTD speakers became more dysfluent and produced shorter descriptions with time, however, only bvFTD speakers actually exhibited reduced speech production. In contrast, svPPA speech had more frequent pauses and speech segments over time, rendering it “fragmented” and inefficient. These findings support the role of automated acoustic analysis in characterizing speech longitudinally in neurodegeneration.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.