Visual shape perception as Bayesian inference of 3D object-centered shape representations.

Erdogan, Goker; Jacobs, Robert A.

doi:10.1037/rev0000086

Cited by 47 publications

(43 citation statements)

References 111 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For decades, these experiments have been taken as support for the view-based theory of object recognition. However, Erdogan & Jacobs (2017) have recently demonstrated that a 3D modelbased theory of object representation would also be expected to generate viewpoint-dependent recognition performance, once sensing noise and observer uncertainty about object structure are taken into account.…”

Section: From 2d To 3dmentioning

confidence: 99%

“…There is also some evidence that 3D structural models may better account for behavioral data than discriminative models, including recent DNN models, even if the latter currently serve as our best predictors of neural response in higher object areas of monkey and human. Erdogan & Jacobs (2017) recently found that a generative, Bayesian structural 3D object model provides better predictions of human judgments of 3D object shape similarity than discriminative models, including DNNs trained on ImageNet.…”

Section: Generative (Structural) Representations Of 3d Shapementioning

confidence: 99%

See 1 more Smart Citation

Shape from Contour: Computation and Representation

Elder

2018

Annu. Rev. Vis. Sci.

View full text Add to dashboard Cite

The human visual system reliably extracts shape information from complex natural scenes in spite of noise and fragmentation caused by clutter and occlusions. A fast, feedforward sweep through ventral stream involving mechanisms tuned for orientation, curvature, and local Gestalt principles produces partial shape representations sufficient for simpler discriminative tasks. More complete shape representations may involve recurrent processes that integrate local and global cues. While feedforward discriminative deep neural network models currently produce the best predictions of object selectivity in higher areas of the object pathway, a generative model may be required to account for all aspects of shape perception. Research suggests that a successful model will account for our acute sensitivity to four key perceptual dimensions of shape: topology, symmetry, composition, and deformation.

show abstract

Section: From 2d To 3dmentioning

confidence: 99%

Section: Generative (Structural) Representations Of 3d Shapementioning

confidence: 99%

Shape from Contour: Computation and Representation

Elder

2018

Annu. Rev. Vis. Sci.

View full text Add to dashboard Cite

show abstract

“…To explain these inferences, early vision scientists proposed that scene analysis proceeds by inverting causal generative models, also known as "analysis-by-synthesis" or "inverse graphics." Approaches to inverse graphics have been considered for decades in computational vision (3,(5)(6)(7)(8), and these models have some behavioral support (9). However, inference in these models has traditionally been based on top-down stochastic search algorithms, such as Markov chain Monte Carlo (MCMC), which are highly iterative and implausibly slow.…”

Section: Introductionmentioning

confidence: 99%

Efficient inverse graphics in biological face processing

et al. 2020

View full text Add to dashboard Cite

Vision not only detects and recognizes objects, but performs rich inferences about the underlying scene structure that causes the patterns of light we see. Inverting generative models, or “analysis-by-synthesis”, presents a possible solution, but its mechanistic implementations have typically been too slow for online perception, and their mapping to neural circuits remains unclear. Here we present a neurally plausible efficient inverse graphics model and test it in the domain of face recognition. The model is based on a deep neural network that learns to invert a three-dimensional face graphics program in a single fast feedforward pass. It explains human behavior qualitatively and quantitatively, including the classic “hollow face” illusion, and it maps directly onto a specialized face-processing circuit in the primate brain. The model fits both behavioral and neural data better than state-of-the-art computer vision models, and suggests an interpretable reverse-engineering account of how the brain transforms images into percepts.

show abstract

“…Skeletal similarity between every object was calculated in 3D, object-centered, space as the mean Euclidean distance between each point on one skeleton and the closest point on the second skeleton following maximal alignment (see Methods). We chose to test a 3D skeletal description because of behavioral 48 and neural 49 evidence for 3D object-centered representations in the visual system, which include a sensitivity to 3D skeletal structures 36,37 .…”

Section: Experiments 1 -Is Perceived Object Similarity Uniquely Predicmentioning

confidence: 99%

Skeletal descriptions of shape provide unique perceptual information for object recognition

Ayzenberg

Lourenco

2019

Preprint

View full text Add to dashboard Cite

With seemingly little effort, humans can both identify an object across large changes in orientation and extend category membership to novel exemplars. Although researchers argue that object shape is crucial in these cases, there are open questions as to how shape is represented for object recognition. Here we tested whether the human visual system incorporates a three-dimensional skeletal descriptor of shape to determine an object's identity. Skeletal models not only provide a compact description of an object's global shape structure, but also provide a quantitative metric by which to compare the visual similarity between shapes. Our results showed that a model of skeletal similarity explained the greatest amount of variance in participants' object dissimilarity judgments when compared with other computational models of visual similarity (Experiment 1). Moreover, parametric changes to an object's skeleton led to proportional changes in perceived similarity, even when controlling for another model of structure (Experiment 2). Importantly, participants preferentially categorized objects by their skeletons across changes to local shape contours and non-accidental properties (Experiment 3). Our findings highlight the importance of skeletal structure in vision, not only as a shape descriptor, but also as a diagnostic cue of object identity.

show abstract

Visual shape perception as Bayesian inference of 3D object-centered shape representations.

Cited by 47 publications

References 111 publications

Shape from Contour: Computation and Representation

Shape from Contour: Computation and Representation

Efficient inverse graphics in biological face processing

Skeletal descriptions of shape provide unique perceptual information for object recognition

Contact Info

Product

Resources

About