The Accuracy of Speech and Linguistic Analysis in Early Diagnostics of Neurocognitive Disorders in a Memory Clinic Setting

Huurne, Daphne ter; Ramakers, Inez H.G.B.; Possemis, Nina; Banning, Leonie C.P.; Gruters, Angélique A A; Asbroeck, Stephanie Van; König, Alexandra; Linz, Nicklas; Tröger, J.; Langel, Kai; Verhey, Frans R.J.; Vugt, Marjolein de

doi:10.1093/arclin/acac105

Cited by 5 publications

(8 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The application used the iPad’s standard internal microphone, which was placed in front of the participant. After speech responses were recorded, they were sent to the backend of ki:elements for preprocessing (such as cutting recordings into relevant parts and audio transformation), automatic speech recognition, and feature extraction [ 31 ]. This resulted in two different measurements of both the total immediate recall and delayed recall: the automatically derived ASR score, and the clinician’s independent score.…”

Section: Methodsmentioning

confidence: 99%

The Reliability and Clinical Validation of Automatically-Derived Verbal Memory Features of the Verbal Learning Test in Early Diagnostics of Cognitive Impairment

Possemis,

ter Huurne,

Banning

et al. 2024

JAD

Self Cite

View full text Add to dashboard Cite

Background: Previous research has shown that verbal memory accurately measures cognitive decline in the early phases of neurocognitive impairment. Automatic speech recognition from the verbal learning task (VLT) can potentially be used to differentiate between people with and without cognitive impairment. Objective: Investigate whether automatic speech recognition (ASR) of the VLT is reliable and able to differentiate between subjective cognitive decline (SCD) and mild cognitive impairment (MCI). Methods: The VLT was recorded and processed via a mobile application. Following, verbal memory features were automatically extracted. The diagnostic performance of the automatically derived features was investigated by training machine learning classifiers to distinguish between participants with SCD versus MCI/dementia. Results: The ICC for inter-rater reliability between the clinical and automatically derived features was 0.87 for the total immediate recall and 0.94 for the delayed recall. The full model including the total immediate recall, delayed recall, recognition count, and the novel verbal memory features had an AUC of 0.79 for distinguishing between participants with SCD versus MCI/dementia. The ten best differentiating VLT features correlated low to moderate with other cognitive tests such as logical memory tasks, semantic verbal fluency, and executive functioning. Conclusions: The VLT with automatically derived verbal memory features showed in general high agreement with the clinical scoring and distinguished well between SCD and MCI/dementia participants. This might be of added value in screening for cognitive impairment.

show abstract

Section: Methodsmentioning

confidence: 99%

The Reliability and Clinical Validation of Automatically-Derived Verbal Memory Features of the Verbal Learning Test in Early Diagnostics of Cognitive Impairment

Possemis,

ter Huurne,

Banning

et al. 2024

JAD

Self Cite

View full text Add to dashboard Cite

show abstract

“…As part of the DeepSpA (Deep Speech Analysis for Cognitive Assessment in Clinical Trials) project, 140 participants were included via the BioBank Alzheimer Centre Limburg (BBACL) study [ 11 ]. Out of the 140 participants, 94 (56 SCD, 38 MCI) completed the semiautomated phone assessment (see Table 1 for participant characteristics).…”

Section: Methodsmentioning

confidence: 99%

“…At baseline, each participant underwent a face-to-face NPA at the hospital as part of a clinical routine [ 11 ]. After 6 months, participants underwent a semiautomated phone assessment, in which a well-trained test leader guided the participant through the phone assessment.…”

Section: Methodsmentioning

confidence: 99%

“…The total word count of both the VLT and SVF and the speech and linguistic features were automatically extracted. Examples of speech and linguistic features of the VLT are “primacy count,” “serial clustering,” and “recency count” and for the SVF, examples are “semantic clustering,” “temporal clustering,” and “mean word frequency” [ 8 , 11 , 12 , 24 , 25 ] (for the full list of the speech and linguistic features, see online suppl. Supplement 1; for all online suppl.…”

Section: Methodsmentioning

confidence: 99%

“…Moreover, ASR may be helpful in differentiating people with subjective cognitive decline (SCD) from MCI/dementia in a clinical face-to-face setting (Possemis et al, unpubl. data) [ 11 , 12 ]. In addition, comparing manual and automatic scoring in a face-to-face setting showed a high level of agreement, demonstrating that ASR can provide accurate results.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Validation of an Automated Speech Analysis of Cognitive Tasks within a Semiautomated Phone Assessment

ter Huurne,

Possemis,

Banning

et al. 2023

Digit Biomark

Self Cite

View full text Add to dashboard Cite

Introduction: We studied the accuracy of the automatic speech recognition (ASR) software by comparing ASR scores with manual scores from a verbal learning test (VLT) and a semantic verbal fluency (SVF) task in a semiautomated phone assessment in a memory clinic population. Furthermore, we examined the differentiating value of these tests between participants with subjective cognitive decline (SCD) and mild cognitive impairment (MCI). We also investigated whether the automatically calculated speech and linguistic features had an additional value compared to the commonly used total scores in a semiautomated phone assessment. Methods: We included 94 participants from the memory clinic of the Maastricht University Medical Center+ (SCD N = 56 and MCI N = 38). The test leader guided the participant through a semiautomated phone assessment. The VLT and SVF were audio recorded and processed via a mobile application. The recall count and speech and linguistic features were automatically extracted. The diagnostic groups were classified by training machine learning classifiers to differentiate SCD and MCI participants. Results: The intraclass correlation for inter-rater reliability between the manual and the ASR total word count was 0.89 (95% CI 0.09–0.97) for the VLT immediate recall, 0.94 (95% CI 0.68–0.98) for the VLT delayed recall, and 0.93 (95% CI 0.56–0.97) for the SVF. The full model including the total word count and speech and linguistic features had an area under the curve of 0.81 and 0.77 for the VLT immediate and delayed recall, respectively, and 0.61 for the SVF. Conclusion: There was a high agreement between the ASR and manual scores, keeping the broad confidence intervals in mind. The phone-based VLT was able to differentiate between SCD and MCI and can have opportunities for clinical trial screening.

show abstract

Storyteller in ADNI4: Application of an early Alzheimer's disease screening tool using brief, remote, and speech‐based testing

Skirrow,

Meepegama,

Weston

et al. 2024

Alzheimer's & Dementia

View full text Add to dashboard Cite

INTRODUCTIONSpeech‐based testing shows promise for sensitive and scalable objective screening for Alzheimer's disease (AD), but research to date offers limited evidence of generalizability.METHODSData were taken from the AMYPRED (Amyloid Prediction in Early Stage Alzheimer's Disease from Acoustic and Linguistic Patterns of Speech) studies (N = 101,N = 46 mild cognitive impairment [MCI]) and Alzheimer's Disease Neuroimaging Initiative 4 (ADNI4) remote digital (N = 426,N = 58 self‐reported MCI, mild AD or dementia) and in‐clinic (N = 57,N = 13 MCI) cohorts, in which participants provided audio‐recorded responses to automated remote story recall tasks in the Storyteller test battery. Text similarity, lexical, temporal, and acoustic speech feature sets were extracted. Models predicting early AD were developed in AMYPRED and tested out of sample in the demographically more diverse cohorts in ADNI4 (> 33% from historically underrepresented populations).RESULTSSpeech models generalized well to unseen data in ADNI4 remote and in‐clinic cohorts. The best‐performing models evaluated text‐based metrics (text similarity, lexical features: area under the curve 0.71–0.84 across cohorts).DISCUSSIONSpeech‐based predictions of early AD from Storyteller generalize across diverse samples.HighlightsThe Storyteller speech‐based test is an objective digital prescreener for Alzheimer's Disease Neuroimaging Initiative 4 (ADNI4).Speech‐based models predictive of Alzheimer's disease (AD) were developed in the AMYPRED (Amyloid Prediction in Early Stage Alzheimer's Disease from Acoustic and Linguistic Patterns of Speech) sample (N = 101).Models were tested out of sample in ADNI4 in‐clinic (N = 57) and remote (N = 426) cohorts.Models showed good generalization out of sample.Models evaluating text matching and lexical features were most predictive of early AD.

show abstract

The Accuracy of Speech and Linguistic Analysis in Early Diagnostics of Neurocognitive Disorders in a Memory Clinic Setting

Cited by 5 publications

References 39 publications

The Reliability and Clinical Validation of Automatically-Derived Verbal Memory Features of the Verbal Learning Test in Early Diagnostics of Cognitive Impairment

The Reliability and Clinical Validation of Automatically-Derived Verbal Memory Features of the Verbal Learning Test in Early Diagnostics of Cognitive Impairment

Validation of an Automated Speech Analysis of Cognitive Tasks within a Semiautomated Phone Assessment

Storyteller in ADNI4: Application of an early Alzheimer's disease screening tool using brief, remote, and speech‐based testing

Contact Info

Product

Resources

About