We compared three-dimensional structure-from-motion (3D-SFM) processing in awake monkeys and humans using functional magnetic resonance imaging. Occipital and midlevel extrastriate visual areas showed similar activation by 3D-SFM stimuli in both species. In contrast, intraparietal areas showed significant 3D-SFM activation in humans but not in monkeys. This suggests that human intraparietal cortex contains visuospatial processing areas that are not present in monkeys.To reconstruct the third dimension from a two-dimensional (2D) retinal image, our brain uses binocular as well as monocular cues such as shading, texture, and occlusion. Both humans and monkeys are also able to extract the 3D structure of an object from motion parallax cues that activate occipitoparietal areas in both species (1-3). Because neurons in the middle temporal area (MT/V5) are sensitive for speed gradients that reflect planes tilted in depth (4, see also 5), this area might play a crucial role in the extraction of depth from motion. Supporting evidence has been gleaned from a functional magnetic resonance imaging (fMRI) study showing 3D-SFM sensitivity in the human MT/V5 complex (hMT/V5ϩ) (6 ). These human fMRI results raise a first question: To what extent can they be generalized to the primate visual system? Furthermore, anatomical evidence suggests that there might be functional differences between human and monkey intraparietal cortex: The intraparietal sulcus separates area 5 from area 7 in the monkey, whereas in humans these two areas belong to the superior parietal lobe. In addition, in humans, the intraparietal lobe separates area 7 from areas 39 and 40, which have no clear counterpart in monkeys (7 ). Therefore, our second goal was to determine whether monkey intraparietal cortex is as important for motion-dependent depth processing as implied by human imaging (6 ).To address these issues, we turned to recently developed fMRI techniques (8) in awake (9-12) rather than anesthetized (13-14 ) monkeys. By performing human fMRI with virtually identical experimental procedures as in the awake monkey fMRI experiments, reliable interspecies comparisons could be made.The stimuli were displays of nine randomly connected lines, rotating in depth, that created a clear 3D percept (movie S1). Control stimuli consisted of the same displays that were either static or moving in one plane. We controlled for potential attentional differences between the 3D and 2D conditions by using a 1-back task in humans, as well as a demanding high-acuity fixation task (8, 9) in both species.In line with earlier reports (4-6, 13), area MT/V5 was activated more by 3D than by 2D moving random-line displays. In addition, the area in the fundus of the superior temporal sulcus (FST) also exhibited significant 3D-SFM sensitivity (Fig. 1A). Figure 2A shows the (3D -2D) pattern of activation for a single human subject, and the average activation for a group of eight subjects is shown in Fig. 2B. These results are similar to those obtained in an earlier study in which so...
The research described in the present article was designed to identify the minimal conditions for the visual perception of 3-dimensional structure from motion by comparing the theoretical limitations of ideal observers with the perceptual performance of actual human subjects on a variety of psychophysical tasks. The research began with a mathematical analysis, which showed that 2-frame apparent motion sequences are theoretically sufficient to distinguish between rigid and nonrigid motion and to identify structural properties of an object that remain invariant under affine transformations, but that 3 or more distinct frames are theoretically necessary to adequately specify properties of euclidean structure such as the relative 3-dimensional lengths or angles between nonparallel line segments. A series of four experiments was then performed to verify the psychological validity of this analysis. The results demonstrated that the determination of structure from motion in actual human observers may be restricted to the use of first order temporal relations, which are available within 2-frame apparent motion sequences. That is to say, the accuracy of observers' judgments did not improve in any of these experiments as the number of distinct frames in an apparent motion sequence was increased from 2 to 8, and performance on tasks involving affine structure was of an order of magnitude greater than performance on similar tasks involving euclidean structure.
A mathematical model of visual flow is presented, which could potentially account for an observer's ability to effectively interact with moving objects. The analysis demonstrates that there is visual information available to an observer about (a) a moving object's angle of approach, (b) changes in its velocity and acceleration, (c) whether its velocity and acceleration are positive or negative, (d) its time to collision with both the horizontal and vertical axes, and (e) whether it will cross the horizontal axis in front of or behind the point of observation. Several experiments are reported in which observers' sensitivity to these different aspects of visual information is examined using a variety of experimental tasks. The results suggest that human observers are highly sensitive to many abstract properties of visual stimulation, but that they are not sensitive to all of the information that is potentially available.
Three-dimensional (3D) shape is important for the visual control of grasping and manipulation and for object recognition. Although there has been some progress in our understanding of how 3D shape is extracted from motion and other monocular cues, little is known of how the human brain extracts 3D shape from disparity, commonly regarded as the strongest depth cue. Previous fMRI studies in the awake monkey have established that the interaction between stereo (present or absent) and the order of disparity (
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.