2016
DOI: 10.1007/978-3-319-43958-7_44
|View full text |Cite
|
Sign up to set email alerts
|

Improving Recognition of Dysarthric Speech Using Severity Based Tempo Adaptation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
1

Relationship

4
1

Authors

Journals

citations
Cited by 8 publications
(10 citation statements)
references
References 13 publications
0
10
0
Order By: Relevance
“…In order to modify healthy control speech to emulate dysarthric speech characteristics, we need to first understand the dysarthric speech itself. In our earlier work [23], we modified the tempo of dysarthric speech based on severity to improve the ASR recognition. It was observed that the sonorant regions of dysarthric speech are of longer durations as compared to that of healthy speech.…”
Section: Methodology 21 Phoneme Duration Analysismentioning
confidence: 99%
“…In order to modify healthy control speech to emulate dysarthric speech characteristics, we need to first understand the dysarthric speech itself. In our earlier work [23], we modified the tempo of dysarthric speech based on severity to improve the ASR recognition. It was observed that the sonorant regions of dysarthric speech are of longer durations as compared to that of healthy speech.…”
Section: Methodology 21 Phoneme Duration Analysismentioning
confidence: 99%
“…We propose a method to improve the recognition of dysarthric speech using enhanced speech features that have been extracted using a Deep Autoencoder (DAE). Additionally, we extend our earlier work [21], wherein we transform the dysarthric speech in the temporal domain using severity-based tempo adaptation (TA) and use the tempo adapted dysarthric speech prior to feature enhancement using a DAE. We analyse the contribution of the individual techniques towards improvement in speech recognition as well as tempo adaptation and DAE-based feature enhancement in tandem.…”
Section: Introductionmentioning
confidence: 88%
“…Tempo adaptation parameters as shown in Table 1, were empirically determined for different severity levels in the UA speech corpus as described in [21]. We use the Kaldi [25] toolkit-based deep autoencoder for our experiments.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Acoustic space modification carried out through temporal and frequency morphing improved automatic dysarthric speech recognition as well as subjective evaluation in [3]. It can be seen that temporal adaptation based on dysarthria severity level improved the ASR performamce for dysarthric speech recognition at each severity level [4]. A Convolutive Bottleneck Network (CBN) was used for dysarthric speech feature extraction wherein the pooling operations of the CBN resulted in features that were more robust towards the small local fluctuations in dysarthric speech and outperformed the traditional MFCC feature based recognition [5].…”
Section: Introductionmentioning
confidence: 91%