A Systematic Review of Online Speech Therapy Systems for Intervention in Childhood Speech Communication Disorders

Attwell, Geertruida Aline; Bennin, Kwabena Ebo; Tekinerdoğan, Bedir

doi:10.3390/s22249713

Cited by 13 publications

(5 citation statements)

References 48 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…It is clear to notice that the majority of proposed SLR focused on only one type of speech disorder, such as 32 and, 33 where only aphasia and Dysarthria are studied, or on a specific type of language, such as Tonal Languages in. 74 Other SLRs 34,35 pay attention to one patient's age. In, 36 the focus is on assistive technologies used as assessment technology for speech disorder patients.…”

Section: Related Workmentioning

confidence: 99%

“… 56 The learning set is used to build the machine learning model, while the testing set is used to evaluate the final model’s performance and generalization. 35 We need to process and turn the user’s speech into a set of features to use ML algorithms.…”

Section: Introductionmentioning

confidence: 99%

“…Most proposed SLRs focused on only one type of speech disorder, such as, 32,33 where only aphasia and Dysarthria are studied, respectively. Other SLRs 34,35 pay attention to one patient's age: children's age. In, 36 the focus is on assistive technologies used.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Exploring the Role of Machine Learning in Diagnosing and Treating Speech Disorders: A Systematic Literature Review

Brahmi,

Mahyoob,

Al-Sarem

et al. 2024

PRBM

View full text Add to dashboard Cite

Purpose Speech disorders profoundly impact the overall quality of life by impeding social operations and hindering effective communication. This study addresses the gap in systematic reviews concerning machine learning-based assistive technology for individuals with speech disorders. The overarching purpose is to offer a comprehensive overview of the field through a Systematic Literature Review (SLR) and provide valuable insights into the landscape of ML-based solutions and related studies. Methods The research employs a systematic approach, utilizing a Systematic Literature Review (SLR) methodology. The study extensively examines the existing literature on machine learning-based assistive technology for speech disorders. Specific attention is given to ML techniques, characteristics of exploited datasets in the training phase, speaker languages, feature extraction techniques, and the features employed by ML algorithms. Originality This study contributes to the existing literature by systematically exploring the machine learning landscape in assistive technology for speech disorders. The originality lies in the focused investigation of ML-speech recognition for impaired speech disorder users over ten years (2014–2023). The emphasis on systematic research questions related to ML techniques, dataset characteristics, languages, feature extraction techniques, and feature sets adds a unique and comprehensive perspective to the current discourse. Findings The systematic literature review identifies significant trends and critical studies published between 2014 and 2023. In the analysis of the 65 papers from prestigious journals, support vector machines and neural networks (CNN, DNN) were the most utilized ML technique (20%, 16.92%), with the most studied disease being Dysarthria (35/65, 54% studies). Furthermore, an upsurge in using neural network-based architectures, mainly CNN and DNN, was observed after 2018. Almost half of the included studies were published between 2021 and 2022).

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Exploring the Role of Machine Learning in Diagnosing and Treating Speech Disorders: A Systematic Literature Review

Brahmi,

Mahyoob,

Al-Sarem

et al. 2024

PRBM

View full text Add to dashboard Cite

show abstract

“…(10) Se concibe como una oportunidad de estimular el lenguaje y la comunicación en general, movilizar los procesos psicológicos, sensoriales, afectivos, interpersonales, cognitivos con el fin de contribuir al desarrollo del aprendizaje y favorecer las habilidades lingüísticas y comunicativas. (22) La atención logopédica posee sus particularidades y sus componentes básicos, sustentados en lo (teóricopráctico) y de las relaciones estructurales y jerárquicas de dicha atención; se enmarcan los siguientes: (23,24,25) • Prevenir los trastornos del lenguaje y la comunicación.…”

Section: Componente Pragmático Del Lenguajeunclassified

A look at phonetics and the pragmatic component of language from a speech therapy point of view

Arzola-Castillo

2023

Salud, Ciencia y Tecnología

View full text Add to dashboard Cite

Nowadays we can find several alterations in the language that gives the guideline to deepen in the subject to approach, for the importance that is conferred to the speech therapy as science, that extends its services to the public health and pedagogy, the speech therapy in the two sectors pursues the purpose of raising the quality of the services, join efforts to achieve an integral citizen in tune with the current demands of the society. Different methods were used from the beginning to the end of the scientific contribution, from the theoretical level: analytical-synthetic, historical-logical, inductive-deductive, from the empirical level: observation, documentary study, speech therapy exploration. These methods made it possible to determine theoretical elements that support the research, evidencing the existence of the problem addressed and its possible ways of solution. For this reason, the scientific problem posed is how to prepare speech therapists on phonetics and the pragmatic component of language from the speech therapy care.

show abstract

“…Automatic speech recognition (ASR) is a technology that enables the conversion of spoken language into written text, making use of machine learning algorithms and acoustic models [ 17 , 18 ]. Over the years, significant advancements in neural networks, such as recurrent neural network (RNN) [ 19 ], bi-directional long short-term memory (BLSTM) [ 20 ], connectionist temporal classification (CTC) [ 21 ], and variants based on the generic networks, have been instrumental in advancing ASR, particularly from the 1990s to the 2010s [ 22 ].…”

Section: Introductionmentioning

confidence: 99%

Improving Text-Independent Forced Alignment to Support Speech-Language Pathologists with Phonetic Transcription

Li,

Wohlan,

Pham

et al. 2023

Sensors

View full text Add to dashboard Cite

Problem: Phonetic transcription is crucial in diagnosing speech sound disorders (SSDs) but is susceptible to transcriber experience and perceptual bias. Current forced alignment (FA) tools, which annotate audio files to determine spoken content and its placement, often require manual transcription, limiting their effectiveness. Method: We introduce a novel, text-independent forced alignment model that autonomously recognises individual phonemes and their boundaries, addressing these limitations. Our approach leverages an advanced, pre-trained wav2vec 2.0 model to segment speech into tokens and recognise them automatically. To accurately identify phoneme boundaries, we utilise an unsupervised segmentation tool, UnsupSeg. Labelling of segments employs nearest-neighbour classification with wav2vec 2.0 labels, before connectionist temporal classification (CTC) collapse, determining class labels based on maximum overlap. Additional post-processing, including overfitting cleaning and voice activity detection, is implemented to enhance segmentation. Results: We benchmarked our model against existing methods using the TIMIT dataset for normal speakers and, for the first time, evaluated its performance on the TORGO dataset containing SSD speakers. Our model demonstrated competitive performance, achieving a harmonic mean score of 76.88% on TIMIT and 70.31% on TORGO. Implications: This research presents a significant advancement in the assessment and diagnosis of SSDs, offering a more objective and less biased approach than traditional methods. Our model’s effectiveness, particularly with SSD speakers, opens new avenues for research and clinical application in speech pathology.

show abstract

A Systematic Review of Online Speech Therapy Systems for Intervention in Childhood Speech Communication Disorders

Cited by 13 publications

References 48 publications

Exploring the Role of Machine Learning in Diagnosing and Treating Speech Disorders: A Systematic Literature Review

Exploring the Role of Machine Learning in Diagnosing and Treating Speech Disorders: A Systematic Literature Review

A look at phonetics and the pragmatic component of language from a speech therapy point of view

Improving Text-Independent Forced Alignment to Support Speech-Language Pathologists with Phonetic Transcription

Contact Info

Product

Resources

About