2022
DOI: 10.22452/mjcs.vol35no3.3
|View full text |Cite
|
Sign up to set email alerts
|

Wdsae-DNDT Based Speech Fluency Disorder Classification

Abstract: In this paper, Weight Decorrelated Stacked Autoencoder-Deep Neural Decision Trees (WDSAE-DNDT), a novel hybrid model is proposed for automating the assessment of children’s speech fluency disorders by discerning their disfluencies. In fluency disorder classification, it is imperative to know how each feature contributes to the disorder classification rather than the diagnosis itself and so the depth modified DNDT acts as the best discriminator since it is interpretable by its very nature. The WDSAE presents DN… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 28 publications
0
2
0
Order By: Relevance
“…Despite being described as a condition, stuttering can occur in anyone's speech and is frequently triggered by stress or anxiousness. The speech disfluency classification models, which incorporate a unique hybrid deep ensemble [5][6][7] for categorizing varied speech disfluencies in the UCLASS [8] and Fluency Bank datasets [9], achieve an accuracy range of 97 to 98.1 %. FluentNet is a discrete model that comprises a residual CNN [10] that learns frame-level representations of the speech spectrum.…”
Section: Back Groundmentioning
confidence: 99%
“…Despite being described as a condition, stuttering can occur in anyone's speech and is frequently triggered by stress or anxiousness. The speech disfluency classification models, which incorporate a unique hybrid deep ensemble [5][6][7] for categorizing varied speech disfluencies in the UCLASS [8] and Fluency Bank datasets [9], achieve an accuracy range of 97 to 98.1 %. FluentNet is a discrete model that comprises a residual CNN [10] that learns frame-level representations of the speech spectrum.…”
Section: Back Groundmentioning
confidence: 99%
“…Performance metrics of efficient DenseNet model Table15, the proposed efficient DenseNet model performs well in unique feature extraction for accurate classification of the severity levels of DR, and it has enhanced the efficacy of DR screening. Moreover, the computational complexity[24][25][26][27][28][29] has been reduced compared with the baseline models. The metrics such as precision, recall, and F1 score are used to monitor the grading of DR by the efficient DenseNet model as depicted in Table16, along with the trainable parameters in Table17.…”
mentioning
confidence: 99%