2021
DOI: 10.3389/fpsyg.2020.623237
|View full text |Cite
|
Sign up to set email alerts
|

Towards Computer-Based Automated Screening of Dementia Through Spontaneous Speech

Abstract: Dementia, a prevalent disorder of the brain, has negative effects on individuals and society. This paper concerns using Spontaneous Speech (ADReSS) Challenge of Interspeech 2020 to classify Alzheimer's dementia. We used (1) VGGish, a deep, pretrained, Tensorflow model as an audio feature extractor, and Scikit-learn classifiers to detect signs of dementia in speech. Three classifiers (LinearSVM, Perceptron, 1NN) were 59.1% accurate, which was 3% above the best-performing baseline models trained on the acoustic … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 24 publications
(22 citation statements)
references
References 33 publications
0
22
0
Order By: Relevance
“…Among the 25 studies applying feature-representation transfer, support vector machines and 'vanilla' neural networks were the most common methods used as the final model producing outputs based on the feature representations. Seven studies applied and compared multiple methods for this purpose [18][19][20][21][22][23][24].…”
Section: Resultsmentioning
confidence: 99%
“…Among the 25 studies applying feature-representation transfer, support vector machines and 'vanilla' neural networks were the most common methods used as the final model producing outputs based on the feature representations. Seven studies applied and compared multiple methods for this purpose [18][19][20][21][22][23][24].…”
Section: Resultsmentioning
confidence: 99%
“…Among the 25 studies applying feature-representation transfer, support vector machines and 'vanilla' neural networks were the most common methods used as the final model producing outputs based on the feature representations. Seven studies applied and compared multiple methods for this purpose [18][19][20][21][22][23][24].…”
Section: Resultsmentioning
confidence: 99%
“…Despite the increasing interest and opportunities for relatively easy and cheap data collection, we only found 10 studies that used transfer learning on audio data. However, these studies covered a variety of fields in medicine (and corresponding audio signals): neurology (speech and electromyography [21, 22, 86, 87]), cardiology (heart sound [88, 89]), pulmonology (respiratory sounds [90, 91]), infectious diseases (cough [92]), and otorhinolaryngology (breathing [93]).…”
Section: Discussionmentioning
confidence: 99%
“…Except for the text-based pre-trained models, audio and image-based pre-trained models also have been explored in speechbased AD detection. Chlasta, K. et al [48] trained modified VGGNet architecture to extract acoustic embedding, while Gauder, L. et al [49] trained wav2vec 2.0 framework to extract acoustic embedding vector, of which both added modified CNN modules for classification, reaching 62.5% and 78.9% accuracy, respectively.…”
Section: Comparisons Of Methods For the Adress Challengementioning
confidence: 99%