2022
DOI: 10.3389/fcomp.2022.770210
|View full text |Cite
|
Sign up to set email alerts
|

Characterizing Dysarthria Diversity for Automatic Speech Recognition: A Tutorial From the Clinical Perspective

Abstract: Despite significant advancements in automatic speech recognition (ASR) technology, even the best performing ASR systems are inadequate for speakers with impaired speech. This inadequacy may be, in part, due to the challenges associated with acquiring a sufficiently diverse training sample of disordered speech. Speakers with dysarthria, which refers to a group of divergent speech disorders secondary to neurologic injury, exhibit highly variable speech patterns both within and across individuals. This diversity … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
14
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 16 publications
(14 citation statements)
references
References 44 publications
0
14
0
Order By: Relevance
“…Dysarthria refers to a group of divergent SMDs often secondary to neurologic injury (but not limited to it) and exhibits highly variable speech patterns within and across individuals [ 10 ]. One of the most established clinical taxonomy for SMD corresponds to the Darley, Aronson, and Brown (DAB) model [ 11 ] that foresees 38 atypical speech features rated on a 7-point scale and groups dysarthria types based on speech feature profiles [ 10 ]. The DAB model split SMD into two classes, apraxia and dysarthria, and dysarthria into five clusters, flaccid, spastic, ataxic, hypokinetic, and hyperkinetic.…”
Section: Introductionmentioning
confidence: 99%
See 4 more Smart Citations
“…Dysarthria refers to a group of divergent SMDs often secondary to neurologic injury (but not limited to it) and exhibits highly variable speech patterns within and across individuals [ 10 ]. One of the most established clinical taxonomy for SMD corresponds to the Darley, Aronson, and Brown (DAB) model [ 11 ] that foresees 38 atypical speech features rated on a 7-point scale and groups dysarthria types based on speech feature profiles [ 10 ]. The DAB model split SMD into two classes, apraxia and dysarthria, and dysarthria into five clusters, flaccid, spastic, ataxic, hypokinetic, and hyperkinetic.…”
Section: Introductionmentioning
confidence: 99%
“…For example, there are ASR speaker-independent (SI) systems, trained on large multispeaker datasets, or ASR speaker-dependent (SD) systems, trained by an existing SI model to a target speaker or by a unique target speaker’s speech data [ 10 , 17 ]. Commercially developed SI have low error rates for healthy speakers but appear to perform considerably worse with speech impairments tasks [ 10 , 18 ]. Thus, extensive work has been conducted on SD systems for speech impairments showing stronger performances than SI [ 10 , 19 , 20 ].…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations