2014 IEEE Spoken Language Technology Workshop (SLT) 2014
DOI: 10.1109/slt.2014.7078583
|View full text |Cite
|
Sign up to set email alerts
|

Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
17
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 22 publications
(17 citation statements)
references
References 20 publications
0
17
0
Order By: Relevance
“…While traditional, off-theshelf Automatic Speech Recognition (ASR) systems perform well for normal speech, this is not the case with the atypical dysarthric speech owing to the inter-speaker and intra-speaker inconsistencies in the acoustic space as well as the sparseness of data. Several techniques are employed to improve ASR performance for dysarthric speech: acoustic space enhancement, feature engineering, Deep Neural Networks (DNN), speaker adaptation, lexical model adaptation-individually or as a combination thereof [2,3,4,5,6]. In order to exploit the machine learning techniques for ASR fully, suitable data to build these systems is imperative.…”
Section: Introductionmentioning
confidence: 99%
“…While traditional, off-theshelf Automatic Speech Recognition (ASR) systems perform well for normal speech, this is not the case with the atypical dysarthric speech owing to the inter-speaker and intra-speaker inconsistencies in the acoustic space as well as the sparseness of data. Several techniques are employed to improve ASR performance for dysarthric speech: acoustic space enhancement, feature engineering, Deep Neural Networks (DNN), speaker adaptation, lexical model adaptation-individually or as a combination thereof [2,3,4,5,6]. In order to exploit the machine learning techniques for ASR fully, suitable data to build these systems is imperative.…”
Section: Introductionmentioning
confidence: 99%
“…In [10], authors propose a DNN based interpretable model for objective assessment of dysarthric speech that provides users with an estimate of severity as well as a set of explanatory features. Speaker selection and speaker adaptation techniques have been employed to improve ASR performance for dysarthric speech in [11,12]. ASR configurations have been designed and optimized using dysarthria severity level cues in [13,14,15].…”
Section: Introductionmentioning
confidence: 99%
“…In [2], a similarity measure between dysarthric speakers to select relevant speaker data for training rather than speaker independent acoustic models, followed by maximum a posteriori (MAP) adaptation has been used. In [3] a more suitable prior model for adaptation based on the dysarthric speaker's acoustic characteristics has been used to achieve improved recognition.…”
Section: Introductionmentioning
confidence: 99%