Spatially invariant computations in stereoscopic vision

Vidal-Naquet, Michel; Gepshtein, Sergei

doi:10.3389/fncom.2012.00047

Cited by 4 publications

(7 citation statements)

References 50 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Since depth is constant within local regions in squarewave gratings (apart from at their edges), the standard cross-correlation model predicts that observers ought to be better at perceiving square-wave gratings than sinewave gratings, where there are no regions of constant depth. This follows directly from the implicit assumption in cross-correlation models that surfaces are locally fronto-parallel; this assumption is met by squarewave gratings, but not by sinewave gratings (see also, Vidal-Naquet and Gepshtein, 2012 ). In a later paper, Allenmark and Read (2011) were able to account for their results by proposing a link between the magnitude of disparity, and the size of the correlation window used to match disparity.…”

Section: Discussionmentioning

confidence: 98%

“…However, although binocular neurons are tuned to broadly similar orientations and spatial frequencies in each eye, there is evidence for differences in the exact orientation tuning in the two monocular receptive fields ( Bridge and Cumming, 2001 ). Greenwald and Knill (2009) have argued that the information provided by a system showing such responsiveness to orientation disparities would provide valuable information about the slant of surfaces in depth (see also Vidal-Naquet and Gepshtein, 2012 , for a more general approach to handling differences in local binocular image structure). Differences in orientation tuning between the two eyes could therefore reduce the influence of orientation similarity matching, and allow for easier measurement of disparity for such slanted surfaces.…”

Section: Discussionmentioning

confidence: 99%

“…Allowing for image transformations, such as scaling or rotation, prior to cross-correlation could reduce the latter constraint, by allowing for the explicit encoding of local variation in depth. Recently, Vidal-Naquet and Gepshtein (2012) have provided a general approach for the inclusion of image transformations in cross-correlation models of disparity measurement.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Mechanisms for similarity matching in disparity measurement

Goutcher

Hibbard

2014

Front. Psychol.

View full text Add to dashboard Cite

Early neural mechanisms for the measurement of binocular disparity appear to operate in a manner consistent with cross-correlation-like processes. Consequently, cross-correlation, or cross-correlation-like procedures have been used in a range of models of disparity measurement. Using such procedures as the basis for disparity measurement creates a preference for correspondence solutions that maximize the similarity between local left and right eye image regions. Here, we examine how observers’ perception of depth in an ambiguous stereogram is affected by manipulations of luminance and orientation-based image similarity. Results show a strong effect of coarse-scale luminance similarity manipulations, but a relatively weak effect of finer-scale manipulations of orientation similarity. This is in contrast to the measurements of depth obtained from a standard cross-correlation model. This model shows strong effects of orientation similarity manipulations and weaker effects of luminance similarity. In order to account for these discrepancies, the standard cross-correlation approach may be modified to include an initial spatial frequency filtering stage. The performance of this adjusted model most closely matches human psychophysical data when spatial frequency filtering favors coarser scales. This is consistent with the operation of disparity measurement processes where spatial frequency and disparity tuning are correlated, or where disparity measurement operates in a coarse-to-fine manner.

show abstract

Section: Discussionmentioning

confidence: 98%

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Mechanisms for similarity matching in disparity measurement

Goutcher

Hibbard

2014

Front. Psychol.

View full text Add to dashboard Cite

show abstract

“…The article by Vidal-Naquet and Gepshtein (2012) shows that populations of V1 complex cells, but not individual complex cells, can compute information about stereoscopic disparity in a spatially invariant fashion.…”

Section: Strategies Of Learning Invariancementioning

confidence: 99%

“…It is worth noting that, while it is generally thought that object invariance is represented by neurons in the higher levels of the visual pathway, such as the inferotemporal cortex, neurons in the lower levels, such as the primary visual cortex or V1, can also play key roles in implementing various aspects of invariance. The article by Vidal-Naquet and Gepshtein ( 2012 ) shows that populations of V1 complex cells, but not individual complex cells, can compute information about stereoscopic disparity in a spatially invariant fashion.…”

Section: Strategies Of Learning Invariancementioning

confidence: 99%

Invariant Recognition of Visual Objects: Some Emerging Computational Principles

Bart¹,

Hegdé²

2012

Front. Comput. Neurosci.

View full text Add to dashboard Cite

Stereo Slant Discrimination of Planar 3D Surfaces: Standard vs. Planar Cross-Correlation

Oluk

Bonnen

Burge

et al. 2021

Preprint

View full text Add to dashboard Cite

Binocular stereo cues are important for discriminating 3D surface orientation, especially at near distances. We devised a single-interval task where observers discriminated the slant of a densely textured planar test surface relative to a textured planar surround reference surface. Although surfaces were rendered with correct perspective, the stimuli were designed so that the binocular cues dominated performance. Slant discrimination performance was measured as a function of the reference slant and the level of uncorrelated white noise added to the test-plane images in the left and right eye. We compared human performance with an approximate ideal observer (planar cross correlation, PCC) and two sub-ideal observers. The PCC observer uses the image in one eye and back projection to predict the test image in the other eye for all possible slants, tilts, and distances. The estimated slant, tilt, and distance are determined by the prediction that most closely matches the measured image in the other eye. The first sub-ideal observer (local PCC, LPCC) applies planar cross correlation over local neighborhoods and then pools estimates across the test plane. The second sub-optimal observer (standard cross correlation, SCC), uses only positional disparity information. We find that the ideal observer (PCC) and the first sub-ideal observer (LPCC) outperform the second sub-ideal observer (SCC), demonstrating the benefits of structural disparities. We also find that all three model observers can account for human performance, if two free parameters are included: a fixed small level of internal estimation noise, and a fixed overall efficiency scalar on slant discriminability.

show abstract

Spatially invariant computations in stereoscopic vision

Cited by 4 publications

References 50 publications

Mechanisms for similarity matching in disparity measurement

Mechanisms for similarity matching in disparity measurement

Invariant Recognition of Visual Objects: Some Emerging Computational Principles

Stereo Slant Discrimination of Planar 3D Surfaces: Standard vs. Planar Cross-Correlation

Contact Info

Product

Resources

About