2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) 2023
DOI: 10.1109/asru57964.2023.10389794
|View full text |Cite
|
Sign up to set email alerts
|

Can We Use Speaker Embeddings On Spontaneous Speech Obtained From Medical Conversations To Predict Intelligibility?

Sebastião Quintas,
Mathieu Balaguer,
Julie Mauclair
et al.

Abstract: The automatic prediction of speech intelligibility is a recurrent problem in the context of pathological speech. Despite recent developments, these systems are normally applied to specific speech tasks recorded in clean conditions that do not necessarily reflect a healthcare environment. In the present paper, we intend to test the reliability of an intelligibility predictor on data obtained in clinical conditions, in the specific case of head and neck cancer. In order to do so, we present a system based on spe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 30 publications
0
2
0
Order By: Relevance
“…In the present work, we made use of the x-vector speaker embeddings, widely used in pathological speech assessment (Jeancolas et al, 2021;Kotarba and Kotarba, 2021;Scheuerer et al, 2021). Concerning the automatic prediction of speech intelligibility, these embeddings have outperformed other embedding types such as i-vectors by more than 10% in correlation on a passage reading task (Quintas et al, 2020), and also the more recent ECAPA_TDNN by as much as 13% on a spontaneous and reading speech tasks (Quintas et al, 2023b), making them the state-of-the-art for this particular task. Similarly to previous works, we intend to use them as features for a subsequent signal processing chain.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In the present work, we made use of the x-vector speaker embeddings, widely used in pathological speech assessment (Jeancolas et al, 2021;Kotarba and Kotarba, 2021;Scheuerer et al, 2021). Concerning the automatic prediction of speech intelligibility, these embeddings have outperformed other embedding types such as i-vectors by more than 10% in correlation on a passage reading task (Quintas et al, 2020), and also the more recent ECAPA_TDNN by as much as 13% on a spontaneous and reading speech tasks (Quintas et al, 2023b), making them the state-of-the-art for this particular task. Similarly to previous works, we intend to use them as features for a subsequent signal processing chain.…”
Section: Methodsmentioning
confidence: 99%
“…These measures are not only clinically relevant, but can also be combined linearly to predict speech intelligibility (Bodt et al, 2002), an aspect that could add an extra layer of interpretability to our system. Furthermore, SAMI could also be adapted to predict the severity of speech disorder and intelligibility measures from a real-life interactions between doctor-patient or even caregiver-patient, an aspect currently underexplored that mainly addresses the assessment of spontaneous speech in clinical contexts (Quintas et al, 2023b).…”
Section: Sami Adaptationsmentioning
confidence: 99%