Interspeech 2019 2019
DOI: 10.21437/interspeech.2019-2993
|View full text |Cite
|
Sign up to set email alerts
|

Study of the Performance of Automatic Speech Recognition Systems in Speakers with Parkinson’s Disease

Abstract: Parkinson's Disease (PD) affects motor capabilities of patients, who in some cases need to use human-computer assistive technologies to regain independence. The objective of this work is to study in detail the differences in error patterns from stateof-the-art Automatic Speech Recognition (ASR) systems on speech from people with and without PD. Two different speech recognizers (attention-based end-to-end and Deep Neural Network-Hidden Markov Models hybrid systems) were trained on a Spanish language corpus and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
2

Relationship

4
5

Authors

Journals

citations
Cited by 20 publications
(7 citation statements)
references
References 29 publications
1
6
0
Order By: Relevance
“…Moreover, ASR for child speech is proven more challenging than that for adult speech, due to children's shorter vocal tracts, slower and more variable speaking rate and inaccurate articulation [8]. A speech impairment is known to cause many problems for standard ASR systems, e.g., for impairments related to dysarthria [9], stroke survival, oral cancer [10] or cleft lip and palate [11]. Additionally, recent studies demonstrate how voice assistants perpetuate a racial divide by misrecognising the speech of black speakers more often than of white speakers [2,7].…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, ASR for child speech is proven more challenging than that for adult speech, due to children's shorter vocal tracts, slower and more variable speaking rate and inaccurate articulation [8]. A speech impairment is known to cause many problems for standard ASR systems, e.g., for impairments related to dysarthria [9], stroke survival, oral cancer [10] or cleft lip and palate [11]. Additionally, recent studies demonstrate how voice assistants perpetuate a racial divide by misrecognising the speech of black speakers more often than of white speakers [2,7].…”
Section: Introductionmentioning
confidence: 99%
“…Our implementation of the CycleGAN-VC is based on a Py-Torch implementation of CycleGAN-VC by 1 . The CycleGAN-VC is used as the baseline model.…”
Section: Cyclegan-vcmentioning
confidence: 99%
“…Dysarthria can greatly reduce a person's quality of life and independence. Operating home appliances through voice could greatly improve these people's lives; however, dysarthric speech recognition performance is not good enough yet for practical applications, which means that there is a great need for high performance dysarthric speech recognition [1].…”
Section: Introductionmentioning
confidence: 99%
“…In addition, child speech recognition is also proven to be more difficult than adult speech [51]. Another important challenge for current ASR systems is the speech impairment, i.e., ASR systems perform quite bad on speech spoken by people with dysarthria [52], oral cancer [53], and cleft lip and palate [54]. Last but not least, speech variance in accents and regions has also shown effecting the ASR performance a lot [55,56].…”
Section: Related Work On Bias Researchmentioning
confidence: 99%