2021
DOI: 10.3390/app11198872
|View full text |Cite
|
Sign up to set email alerts
|

Improving Aphasic Speech Recognition by Using Novel Semi-Supervised Learning Methods on AphasiaBank for English and Spanish

Abstract: Automatic speech recognition in patients with aphasia is a challenging task for which studies have been published in a few languages. Reasonably, the systems reported in the literature within this field show significantly lower performance than those focused on transcribing non-pathological clean speech. It is mainly due to the difficulty of recognizing a more unintelligible voice, as well as due to the scarcity of annotated aphasic data. This work is mainly focused on applying novel semi-supervised learning m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 20 publications
(8 citation statements)
references
References 41 publications
0
8
0
Order By: Relevance
“…Aphasic speech is considered in two of the papers. Torre et al [7] addressed the difficulty of transcribing very unintelligible speech and the scarcity of annotated data by proposing a new semi-supervised learning method with encouraging results. On the other hand, Cistola al.…”
Section: Recent Advances In Application Of Speech and Language Techno...mentioning
confidence: 99%
“…Aphasic speech is considered in two of the papers. Torre et al [7] addressed the difficulty of transcribing very unintelligible speech and the scarcity of annotated data by proposing a new semi-supervised learning method with encouraging results. On the other hand, Cistola al.…”
Section: Recent Advances In Application Of Speech and Language Techno...mentioning
confidence: 99%
“…To identify the pathological symptoms shown in a narrative speech of PWA, researchers have focused on linguistic features (e.g., word frequency, Part-of-Speech (Le et al, 2018), word embeddings (Qin et al, 2019a)), and acoustic features (e.g., filler words, pauses, the number of phones per word (Le and Provost, 2016;Qin et al, 2019b)). With the advances in automated speech recognition (ASR) that can make the transcription of aphasic speech into text (Radford et al, 2022;Baevski et al, 2020), there have been end-toend approaches that do not require explicit feature extraction in assessing patients with aphasia (Chatzoudis et al, 2022;Torre et al, 2021). While these works reveal valuable insight into detecting aphasia, little attention has been paid to identifying the types of aphasia, which can be crucial for proper treatment procedures.…”
Section: Related Workmentioning
confidence: 99%
“…Since a large amount of data is needed as a basis for standardization studies, automated analyses and the recognition of language peculiarities, creating an underlying database is correspondingly complex and will take time. Nevertheless, this approach to digital diagnostics is very promising and should be pursued further (e.g., Kohlschein et al, 2018;Torre et al, 2021).…”
Section: Our Perspectivementioning
confidence: 99%