2011
DOI: 10.1007/s10772-011-9104-6
|View full text |Cite
|
Sign up to set email alerts
|

Using speech rhythm knowledge to improve dysarthric speech recognition

Abstract: We introduce a new framework to improve the dysarthric speech recognition by using the rhythm knowledge. This approach builds speaker-dependent (SD) recognizers with respect to the dysarthria severity level of each speaker. This severity level is determined by a hybrid classifier combining class posterior distributions and a hierarchical structure of multilayer perceptrons. To perform this classification, rhythm-based features are used as input parameters since the preliminary evidence from perceptual experime… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
5
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 18 publications
(5 citation statements)
references
References 15 publications
0
5
0
Order By: Relevance
“…Other binary algorithms examples are linear discriminant analysis (LDA), k-nearest neighbour but with lower accuracy (53). As is the case with other behavioural markers, adding sensitivity beyond binary outcomes (e.g., adding levels of intelligibility) can lead to decreases in accuracy (57). Some recent examples of hierarchical machine-learning model (combination of machine and deep learning algorithms) revealed promising results in ataxic groups (54,(58)(59)(60).…”
Section: Protocol Designmentioning
confidence: 99%
“…Other binary algorithms examples are linear discriminant analysis (LDA), k-nearest neighbour but with lower accuracy (53). As is the case with other behavioural markers, adding sensitivity beyond binary outcomes (e.g., adding levels of intelligibility) can lead to decreases in accuracy (57). Some recent examples of hierarchical machine-learning model (combination of machine and deep learning algorithms) revealed promising results in ataxic groups (54,(58)(59)(60).…”
Section: Protocol Designmentioning
confidence: 99%
“…Speech recognition uses a variety of modeling techniques to ensure better performance [15]. Rhythmic knowledgeof dysarthric speeches improves the system's accuracy [16] .…”
Section: Introductionmentioning
confidence: 99%
“…As is the case with other behavioural markers, adding sensitivity beyond binary outcomes (e.g. adding levels of intelligibility) can lead to decreases in accuracy [ 58 ]. Some recent examples of hierarchical machine-learning model (combination of machine and deep learning algorithms) revealed promising results in ataxic groups [ 55 , 59 61 ].…”
Section: Introductionmentioning
confidence: 99%