Facial expression is retained in deep networks trained for face identification

Colón, Yvette; Castillo, Carlos D.; O’Toole, Alice J.

doi:10.1167/jov.21.4.4

Cited by 14 publications

(11 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…1C ). The results indicated that the expression-selective units spontaneously emerged in the VGG-Face pretrained for face identity recognition, which echoed previous findings ( 19 , 20 ).…”

Section: Resultssupporting

confidence: 89%

“…It should be noted that, in our study, the classification accuracies of the expression-selective units in the pretrained VGG-Face were worse than the performance of expression recognition in humans, which was consistent with a recent finding showing that the identity-trained DCNN retained expression information but with expression recognition accuracies far below human performance ( 20 ). The reason for the decreased expression recognition performance deserves future investigation, although it is beyond the scope of the present study.…”

Section: Discussionsupporting

confidence: 92%

“…Thus, DCNNs could be a useful model simulating the processes of biological neural systems. More recently, several seminal studies have found that the DCNNs trained to recognize facial expression spontaneously developed facial identity recognition ability, and vice versa, suggesting that integrated representations of identity and expression may arise naturally within neural networks like humans do ( 19 , 20 ). However, a recent study found that face identity–selective units could spontaneously emerge in an untrained DCNN ( 21 ), which seemed to cast substantial doubt on the role of nurture in developing face perception and the abovementioned speculation.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Emerged human-like facial expression representation in a deep convolutional neural network

et al. 2022

View full text Add to dashboard Cite

Recent studies found that the deep convolutional neural networks (DCNNs) trained to recognize facial identities spontaneously learned features that support facial expression recognition, and vice versa. Here, we showed that the self-emerged expression-selective units in a VGG-Face trained for facial identification were tuned to distinct basic expressions and, importantly, exhibited hallmarks of human expression recognition (i.e., facial expression confusion and categorical perception). We then investigated whether the emergence of expression-selective units is attributed to either face-specific experience or domain-general processing by conducting the same analysis on a VGG-16 trained for object classification and an untrained VGG-Face without any visual experience, both having the identical architecture with the pretrained VGG-Face. Although similar expression-selective units were found in both DCNNs, they did not exhibit reliable human-like characteristics of facial expression perception. Together, these findings revealed the necessity of domain-specific visual experience of face identity for the development of facial expression perception, highlighting the contribution of nurture to form human-like facial expression perception.

show abstract

“…1C ). The results indicated that the expression-selective units spontaneously emerged in the VGG-Face pretrained for face identity recognition, which echoed previous findings ( 19 , 20 ).…”

Section: Resultssupporting

confidence: 89%

Section: Discussionsupporting

confidence: 92%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Emerged human-like facial expression representation in a deep convolutional neural network

et al. 2022

View full text Add to dashboard Cite

show abstract

“…PCA+LDA models suggest that there is enough physical commonality between images to support recognition of familiar faces across their lifespan, suggesting that multiple different representations are not necessary (Mileva et al, 2020). Likewise, DCNNs store diverse images of the same face (e.g., images that vary in viewpoint, lighting, expression) in the same region of face space (Colón et al, 2021;Hill et al, 2019;see O'Toole et al, 2018 for a summary), suggesting that images from multiple decades might reside in close proximity. Nevertheless, any theory derived from computational models should be complemented by behavioural data from humans.…”

Section: Practitioner Pointsmentioning

confidence: 99%

“…Two lines of evidence show that DCNNs retain a representation of within-identity variability. First, visualization of the top layer of face space shows that images of an identity are clustered based on viewpoint, lighting, expression, and whether the input was a still image or video (Colón et al, 2021;Hill et al, 2019). Second, classification of image attributes (e.g., expression, viewpoint, whether the input was a still image or video) based on output at the top levels of DCNNs is highly accurate (Colón et al;Parde et al, 2017; see also Dhar et al, 2020 for expressivity as a measure of which image attributes are retained).…”

Section: Practitioner Pointsmentioning

confidence: 99%