Interspeech 2021 2021
DOI: 10.21437/interspeech.2021-549
|View full text |Cite
|
Sign up to set email alerts
|

EasyCall Corpus: A Dysarthric Speech Dataset

Abstract: This paper introduces a new dysarthric speech command dataset in Italian, called EasyCall corpus. The dataset consists of 21386 audio recordings from 24 healthy and 31 dysarthric speakers, whose individual degree of speech impairment was assessed by neurologists through the Therapy Outcome Measure. The corpus aims at providing a resource for the development of ASR-based assistive technologies for patients with dysarthria. In particular, it may be exploited to develop a voice-controlled contact application for … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 20 publications
(11 citation statements)
references
References 7 publications
1
10
0
Order By: Relevance
“…Compared with more widely available normal speech corpora, such as Switchboard and Fisher conversational telephone speech [52] or LibriSpeech [53] containing from hundreds to thousands of hours of audio data, existing dysarthric and elderly speech corpora are much smaller in size. Similar data scarcity is also found not only among dysarthric speech datasets collected for non-English languages such as Dutch [54], Italian [55], Cantonese [56] and Korean [49], but also in elderly speech corpora.…”
Section: Introductionsupporting
confidence: 66%
“…Compared with more widely available normal speech corpora, such as Switchboard and Fisher conversational telephone speech [52] or LibriSpeech [53] containing from hundreds to thousands of hours of audio data, existing dysarthric and elderly speech corpora are much smaller in size. Similar data scarcity is also found not only among dysarthric speech datasets collected for non-English languages such as Dutch [54], Italian [55], Cantonese [56] and Korean [49], but also in elderly speech corpora.…”
Section: Introductionsupporting
confidence: 66%
“…Similar to the PC-GITA corpus, the EasyCall experiments include a training session with only control speakers and another session with control and dysarthric speakers (no speaker in the test set in the train set). The train and test split was selected based on the split from the EasyCall article [23]. The test set contains 12 dysarthric speakers (7 male, 5 female) ranging from mild to severe dysarthria.…”
Section: Methodsmentioning
confidence: 99%
“…The EasyCall corpus consists of 24 healthy (10 females, 14 males) and 31 dysarthric (11 females, 20 males) Italian speakers [23]. A range of disorders causing dysarthria includes Parkinson's Disease, Huntington's Disease, Amyotrophic Lateral Sclerosis, peripheral neuropathy, myopathic or myasthenic lesions.…”
Section: Easycallmentioning
confidence: 99%
“…For example, the PRAUTOCAL corpus of speech in Down syndrome (Escudero-Mancebo et al, 2021) contains sentences obtained from speakers with Down syndrome during a video game and qualitatively assessed by several experts. The EasyCall is a dysarthric speech dataset of commands most likely to be used in a voice-controlled contact application (Turrisi et al, 2021). In the Atlanta Motor Speech Disorders Corpus (Laures-Gore et al, 2016) the data include single vowels, single words, sentences and discourse passages from people with motor speech disorders who speak different dialects of English.…”
Section: Oral Speech Corpora In Clinical Linguistics: An Overviewmentioning
confidence: 99%