2024
DOI: 10.3389/fnins.2024.1379988
|View full text |Cite
|
Sign up to set email alerts
|

Synthetic faces generated with the facial action coding system or deep neural networks improve speech-in-noise perception, but not as much as real faces

Yingjia Yu,
Anastasia Lado,
Yue Zhang
et al.

Abstract: The prevalence of synthetic talking faces in both commercial and academic environments is increasing as the technology to generate them grows more powerful and available. While it has long been known that seeing the face of the talker improves human perception of speech-in-noise, recent studies have shown that synthetic talking faces generated by deep neural networks (DNNs) are also able to improve human perception of speech-in-noise. However, in previous studies the benefit provided by DNN synthetic faces was… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 38 publications
0
2
0
Order By: Relevance
“…The model assumes that audiovisual disparity is an intrinsic property of different McGurk stimuli. Perceptual studies using advanced synthetic faces should also allow more insight into understanding and manipulating the factors contributing to stimulus disparity ( Thézé et al, 2020 ; Varano et al, 2021 ; Shan et al, 2022 ; Yu et al, 2024 ), as should measurements of the mouth and face movements made by real talkers ( Jiang et al, 2007 ). The NED model fits a sensory noise parameter for each participant, with the finding that sensory noise increases with age.…”
Section: Discussionmentioning
confidence: 99%
“…The model assumes that audiovisual disparity is an intrinsic property of different McGurk stimuli. Perceptual studies using advanced synthetic faces should also allow more insight into understanding and manipulating the factors contributing to stimulus disparity ( Thézé et al, 2020 ; Varano et al, 2021 ; Shan et al, 2022 ; Yu et al, 2024 ), as should measurements of the mouth and face movements made by real talkers ( Jiang et al, 2007 ). The NED model fits a sensory noise parameter for each participant, with the finding that sensory noise increases with age.…”
Section: Discussionmentioning
confidence: 99%
“…The model assumes that audiovisual disparity is an intrinsic property of different McGurk stimuli. Perceptual studies using advanced synthetic faces should also allow more insight into understanding and manipulating the factors contributing to stimulus disparity (Thézé et al, 2020;Varano et al, 2021;Shan et al, 2022;Yu et al, 2024), as should measurements of the mouth and face movements made by real talkers (Jiang et al, 2007). The NED model fits a sensory noise parameter for each participant, with the finding that sensory noise increases with age.…”
Section: Limitationsmentioning
confidence: 99%