Interspeech 2020 2020
DOI: 10.21437/interspeech.2020-2698
|View full text |Cite
|
Sign up to set email alerts
|

Improving Detection of Alzheimer’s Disease Using Automatic Speech Recognition to Identify High-Quality Segments for More Robust Feature Extraction

Abstract: Speech and language based automatic dementia detection is of interest due to it being non-invasive, low-cost and potentially able to aid diagnosis accuracy. The collected data are mostly audio recordings of spoken language and these can be used directly for acoustic-based analysis. To extract linguistic-based information, an automatic speech recognition (ASR) system is used to generate transcriptions. However, the extraction of reliable acoustic features is difficult when the acoustic quality of the data is po… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 16 publications
(12 citation statements)
references
References 27 publications
0
11
0
Order By: Relevance
“…(2). Using mul-tiple ASR hypotheses and confidence scores as an input to the BERT system improves the performance compared to using just the best hypothesis as in previously proposed approached [12]. (3).…”
Section: Introductionmentioning
confidence: 90%
See 3 more Smart Citations
“…(2). Using mul-tiple ASR hypotheses and confidence scores as an input to the BERT system improves the performance compared to using just the best hypothesis as in previously proposed approached [12]. (3).…”
Section: Introductionmentioning
confidence: 90%
“…Disfluencies and unclear pronunciation can decrease the performance of ASR systems, whilst at the same time be beneficial if the related information is used to inform the classification [5,12]. In [5], the pause and disfluency annotation was used for punctuation generation providing important linguistic information in addition to the manual transcripts.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…(3) Filled pauses -Next to the number and total duration of silent pauses, we derived frequency counts per sentence for two filled pause type, uh and um, that had been shown to discriminate between AD patients and controls in previous studies [9]. (4) Pronunciation -As the known symptoms of AD patients include mispronunciation [14], we calculated average word level confidence scores as a proxy of pronunciation quality, which have been employed for the speech pattern detection in the context of detection of Alzheimer's Disease [15]. All measures were calculated at utterance level.…”
Section: (Dis)fluencymentioning
confidence: 99%