2021
DOI: 10.1109/lsp.2021.3050362
|View full text |Cite
|
Sign up to set email alerts
|

Utterance Verification-Based Dysarthric Speech Intelligibility Assessment Using Phonetic Posterior Features

Abstract: In the literature, the task of dysarthric speech intelligibility assessment has been approached through development of different low-level feature representations, subspace modeling, phone confidence estimation or measurement of automatic speech recognition system accuracy. This paper proposes a novel approach where the intelligibility is estimated as the percentage of correct words uttered by a speaker with dysarthria by matching and verifying utterances of the speaker with dysarthria against control speakers… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 13 publications
(8 citation statements)
references
References 24 publications
0
8
0
Order By: Relevance
“…M = Male, and F= Female Dataset Repository Speech Disorders Type Instances Languages References AphasiaBank Aphasia 83 Aphasic 180 with and 140 without English, Spanish, German, Italian, Hungarian, Mandarin, Chinese [ 84 ] PC-GITA 85 - Parkinson’s disease 50 with (25 M, 25 F) and 50 without Spanish [ 26 , 55 , 86–90 ] TORGO 78 TORGO 91 Dysarthria 7 With (4 M, 3 F): 2762 utterances, 5980 from healthy speakers English [ 14 , 55 , 86 , 89 , 92–95 ] Finnish 55 PD Parkinson’s disease 35 with (14 male, 21 fe- male), and 32 without (12 male, 20 female) Finnish [ 55 ] UA Speech 79 UASpeech 96 Dysarthria: cerebral palsy 15 with (4 M, 11 F), and 13 without. 765 isolated words per speaker English [ 26 , 89 , 97–103 ] PDSTU 104 …”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…M = Male, and F= Female Dataset Repository Speech Disorders Type Instances Languages References AphasiaBank Aphasia 83 Aphasic 180 with and 140 without English, Spanish, German, Italian, Hungarian, Mandarin, Chinese [ 84 ] PC-GITA 85 - Parkinson’s disease 50 with (25 M, 25 F) and 50 without Spanish [ 26 , 55 , 86–90 ] TORGO 78 TORGO 91 Dysarthria 7 With (4 M, 3 F): 2762 utterances, 5980 from healthy speakers English [ 14 , 55 , 86 , 89 , 92–95 ] Finnish 55 PD Parkinson’s disease 35 with (14 male, 21 fe- male), and 32 without (12 male, 20 female) Finnish [ 55 ] UA Speech 79 UASpeech 96 Dysarthria: cerebral palsy 15 with (4 M, 11 F), and 13 without. 765 isolated words per speaker English [ 26 , 89 , 97–103 ] PDSTU 104 …”
Section: Discussionmentioning
confidence: 99%
“…17 speakers, 5229 words French [ 26 ] ATR 119 - Dysarthria 17 speakers, 5229 words Japanese [ 96 ] TIMIT 120 TIMIT 121 Dysarthria 630 speakers. 3310 sentences English [ 111 ] CMUArctic Festvox 122 Dysarthria 5 M, and 2 F. 1150 utterances English [ 123 ] EMA 97 - Dysarthria 3 (1 M, 2 F), 680 utterances English [ 124 ] IEMOCAP 125 - Dysarthria 1 M and 1 F. 3900 utterances English: USA [ 124 ] Parkinsons 28 Parkinsons 126 Parkinson’s disease 23 with, and 8 without English [ 127 ] [ 70 ] - Parkinson’s disease 20 with and 20 without. 1040 speech signals Turkish [ 70 ] Spanish datasets 13 <...…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In this regard, it can be claimed that specific defects in articulation at minimal dysarthric disorders lead to the appearance of qualitatively different changes in the spectral characteristics of sounds [14][15][16]. Currently, in Russian speech therapy, the linguistic aspect of sound-pronunciation disorders in children with this type of speech dysontogenesis is practically not developed.…”
Section: Introductionmentioning
confidence: 99%
“…A bidirectional Deep Recurrent Neural Network (biRNN) based DNN-HMM is used for phoneme recognition [15]. In a recent work [19], the authors used a phonetic posterior feature space for matching and verifying the impaired speech with the control speakers data. Several parameters such as Linear Discriminant Analysis (LDA), context dependent states, Feature space Maximum Liklihood Linear Regression (FMLLR) are used with Teacher-Student network [20] to increase the accuracy.…”
Section: Introductionmentioning
confidence: 99%