2019
DOI: 10.1007/978-3-030-11012-3_35
|View full text |Cite
|
Sign up to set email alerts
|

Deep Multitask Gaze Estimation with a Constrained Landmark-Gaze Model

Abstract: The step preceding the speaker identification process consists in the determination of the authenticity of a speech sample. The focus of this thesis is on the performance of humans in detecting altered samples from replay, speech synthesis (TTS) and voice conversion (VC) systems. A listening test was constructed on the online survey platform LimeSurvey. The participants were asked to assess a series of recordings by first giving a binary evaluation ("authentic" or "altered") and then by specifying their level … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
39
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 52 publications
(39 citation statements)
references
References 55 publications
0
39
0
Order By: Relevance
“…Zhang et al (Zhang et al 2015) first proposed a CNNs-based method to estimate gaze, the method was designed based on LeNet (Lecun et al 1998) and estimates gaze from eye images. Yu et al (Yu, Liu, and Odobez 2018) proposed a multitask gaze estimation model with landmark constrain, they estimate gaze from eye images. Fischer et al (Fischer, Chang, and Demiris 2018) extracted feature from two-eye images with VGG-16 (Karen and Andrew 2014) to estimate gaze, they use an ensemble scheme to increase robustness of proposed method.…”
Section: Appearance-based Methodsmentioning
confidence: 99%
“…Zhang et al (Zhang et al 2015) first proposed a CNNs-based method to estimate gaze, the method was designed based on LeNet (Lecun et al 1998) and estimates gaze from eye images. Yu et al (Yu, Liu, and Odobez 2018) proposed a multitask gaze estimation model with landmark constrain, they estimate gaze from eye images. Fischer et al (Fischer, Chang, and Demiris 2018) extracted feature from two-eye images with VGG-16 (Karen and Andrew 2014) to estimate gaze, they use an ensemble scheme to increase robustness of proposed method.…”
Section: Appearance-based Methodsmentioning
confidence: 99%
“…The results of gaze estimation are significantly affected by the head pose. Similar to [32], we normalize the image data to weaken the influence of this factor. The basic concept of data preprocessing is shown in Fig.…”
Section: B Data Preprocessingmentioning
confidence: 99%
“…Fischer et al [31] recorded a new dataset of different head postures to improve the robustness of gaze estimation, applied semantic image inpainting to the area covered by glasses to eliminate the obtrusiveness of the glasses and built a bridge between training and test images. Yu et al [32] introduced a constrained landmark-Gaze model to get the relation of eye landmark locations and gaze directions. Park et al [33] used single eye image as input and simplified the task of 3D gaze estimation.…”
Section: Introductionmentioning
confidence: 99%
“…With recent advances in machine learning, the combination of appearance-based gaze estimation and deep learning has become a popular method for remote estimation of gaze [53,7,54,48,13,55]. Appearance-based algorithms have the advantage of only using an image of the person's face or eyes as input, therefore eliminating the need of any specialized hardware other than a regular camera.…”
Section: Introductionmentioning
confidence: 99%