Proceedings of SLPAT 2015: 6th Workshop on Speech and Language Processing for Assistive Technologies 2015
DOI: 10.18653/v1/w15-5112
|View full text |Cite
|
Sign up to set email alerts
|

Model adaptation and adaptive training for the recognition of dysarthric speech

Abstract: Dysarthria is a neurological speech disorder, which exhibits multi-fold disturbances in the speech production system of an individual and can have a detrimental effect on the speech output. In addition to the data sparseness problems, dysarthric speech is characterised by inconsistencies in the acoustic space making it extremely challenging to model. This paper investigates a variety of baseline speaker independent (SI) systems and its suitability for adaptation. The study also explores the usefulness of speak… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
34
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 45 publications
(35 citation statements)
references
References 23 publications
(32 reference statements)
1
34
0
Order By: Relevance
“…Between the development of DM-NSR and Speech Vision, there have been few other attempts to design dysarthric-specific ASR. The first attempt is [12], in which a whole-word speaker adaptive dysarthric ASR was designed and evaluated on UA-Speech speakers with a vocabulary size of 155 words. Based on whether an ASR system is open-set or closed-set speaker, ASR tasks are categorized into three categories.…”
Section: Related Workmentioning
confidence: 99%
“…Between the development of DM-NSR and Speech Vision, there have been few other attempts to design dysarthric-specific ASR. The first attempt is [12], in which a whole-word speaker adaptive dysarthric ASR was designed and evaluated on UA-Speech speakers with a vocabulary size of 155 words. Based on whether an ASR system is open-set or closed-set speaker, ASR tasks are categorized into three categories.…”
Section: Related Workmentioning
confidence: 99%
“…Speaker selection and speaker adaptation techniques have been employed to improve ASR performance for dysarthric speech in [11,12]. ASR configurations have been designed and optimized using dysarthria severity level cues in [13,14,15].…”
Section: Introductionmentioning
confidence: 99%
“…In [6], a set of MFCC features, that best represent dysarthric acoustic features was selected to be used in Artificial Neural Network (ANN)-based ASR. A hybrid adaptation using maximum likelihood linear regression (MLLR) and MAP [7] have been used to improve dysarthric speech recognition. Voice parameters such as jitter and shimmer features along with a multi-taper spectral estimation have been used along with feature space maximum likelihood linear regression (fMLLR) transformation and speaker adaptation to obtain improved dysarthric speech recognition [8].…”
Section: Introductionmentioning
confidence: 99%
“…Traditionally, speech intelligibility has been an indicator of severity of the speech disorder [13]. An understanding of severity has contributed to improved speech recognition of dysarthric speech as seen in [7,14,15].…”
Section: Introductionmentioning
confidence: 99%