2022
DOI: 10.1016/j.icte.2021.07.004
|View full text |Cite
|
Sign up to set email alerts
|

Dysarthric-speech detection using transfer learning with convolutional neural networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
13
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(13 citation statements)
references
References 5 publications
0
13
0
Order By: Relevance
“…A crucial reason is that the data is difficult to meet the needs of state-of-the-art end-to-end recognition methods. Therefore, some studies have tried solutions such as data augmentation [ 139 , 140 , 149 , 171 ] and transfer learning [ 141 , 145 , 149 ] to solve this problem. The details of the diagnosis systems in these studies, including data sources, voice type, voice feature, classifier, and effect, can be found in Table 4 .…”
Section: Pathological Voice Recognition For Diagnosis and Evaluationmentioning
confidence: 99%
See 1 more Smart Citation
“…A crucial reason is that the data is difficult to meet the needs of state-of-the-art end-to-end recognition methods. Therefore, some studies have tried solutions such as data augmentation [ 139 , 140 , 149 , 171 ] and transfer learning [ 141 , 145 , 149 ] to solve this problem. The details of the diagnosis systems in these studies, including data sources, voice type, voice feature, classifier, and effect, can be found in Table 4 .…”
Section: Pathological Voice Recognition For Diagnosis and Evaluationmentioning
confidence: 99%
“…DL accounts for only a small part of the current pathological voice recognition methods because a large amount of data is unavailable [ 92 , 93 , 101 , 104 , 110 , [119] , [124] , 129 , 145 , 146 ]. However, researchers can continuously explore the potential of DL methods such as Attention-based LSTM [ 45 , 47 ], end-to-end models [ 134 ], and Transformer models [ 187 ] and try advanced recognition algorithms to improve the performance of IST for medical applications.…”
Section: Limitations and Future Directionsmentioning
confidence: 99%
“…Other studies were focused on the detection of Dysarthria [11,12]. Dysarthria is a motor impairment preventing a person from articulating speech correctly.…”
Section: Introductionmentioning
confidence: 99%
“…These studies generally use TORGO dataset [13] to train and validate their models which consists of audios of varying lengths and of different categories: non-words, short words, restricted sentences, and unrestricted sentences. Several DL models were used, such as Mel-Spectrograms as input to a 2D CNN network [11] or a GAN model with spectral features [12].…”
Section: Introductionmentioning
confidence: 99%
“…In 2021, S R Mani et al [15] proposed a transfer learningbased convolutional neural network model (TL-CNN) and converted audio samples to Mel-spectrograms to improve accuracy on the TORGO dataset. When compared to other machine learning models, the proposed TL-CNN achieved improved accuracy.…”
mentioning
confidence: 99%