C3DPO: Canonical 3D Pose Networks for Non-Rigid Structure From Motion

Novotný, David; Ravi, Nikhila; Graham, Ben; Neverova, Natalia; Vedaldi, Andrea

doi:10.1109/iccv.2019.00778

Cited by 119 publications

(181 citation statements)

References 37 publications

Supporting

Mentioning

180

Contrasting

Order By: Relevance

“…S2,S3 in supplementary). Moreover, extracting 3D posture of animals from images and videos is an emerging topic in the computer vision community [37,55,63,88]. Posture based video analysis and activity recognition are currently being investigated using deep learning techniques.…”

Section: Growing Support From Robotics and Computer Visionmentioning

confidence: 99%

Animals in Virtual Environments

Naik

Bastien

Navab

et al. 2020

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

The core idea in an XR (VR/MR/AR) application is to digitally stimulate one or more sensory systems (e.g. visual, auditory, olfactory) of the human user in an interactive way to achieve an immersive experience. Since the early 2000s biologists have been using Virtual Environments (VE) to investigate the mechanisms of behavior in non-human animals including insects, fish, and mammals. VEs have become reliable tools for studying vision, cognition, and sensory-motor control in animals. In turn, the knowledge gained from studying such behaviors can be harnessed by researchers designing biologically inspired robots, smart sensors, and multi-agent artificial intelligence. VE for animals is becoming a widely used application of XR technology but such applications have not previously been reported in the technical literature related to XR. Biologists and computer scientists can benefit greatly from deepening interdisciplinary research in this emerging field and together we can develop new methods for conducting fundamental research in behavioral sciences and engineering. To support our argument we present this review which provides an overview of animal behavior experiments conducted in virtual environments.Index Terms-animal behavior, VR for animals, mechanism of behavior, interactive experiments, closed-loop

show abstract

Section: Growing Support From Robotics and Computer Visionmentioning

confidence: 99%

Animals in Virtual Environments

Naik

Bastien

Navab

et al. 2020

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

show abstract

“…Linear low-rank shape basis [2,34,35], low-rank trajectory basis [36], isometry or piece-wise rigidity [37,38] are some of the different methods used for NRSfM. Recently, a few number of works have used low-rank shape basis in order to devise learned methods [1,31,33,39]. Another useful tool in modeling shape category is the reflective symmetry, which is also directly related to the object pose.…”

Section: Related Workmentioning

confidence: 99%

“…Several recent works have modeled shapes in a category as instances of nonrigid deformations [1,31,33,39]. The motivation lies in the fact that such shapes often share similarities to a large extent.…”

Section: Category-specific Shapes As Instances Of Non-rigiditymentioning

confidence: 99%

“…In fact, keypoints-based methods have been crucial to the success of many vision applications. A few examples include; 3D reconstruction [1][2][3], registration [4][5][6][7], human body pose [8][9][10][11], recognition [12,13], and generation [14,15]. That being said, many keypoints are defined manually, while considering their semantic locations such as facial landmarks and human body joints, to serve and simplify the problem at hand.…”

Section: Arxiv:200307619v2 [Cscv] 4 May 2020 1 Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Unsupervised Learning of Category-Specific Symmetric 3D Keypoints from Point Sets

Fernandez-Labrador

Chhatkuli

Paudel

et al. 2020

Lecture Notes in Computer Science

View full text Add to dashboard Cite

“…This makes it easy to update the algorithm with methodo-492 logical advances in the field. Deep learning in 3D is still thought to be in its infancy 68 , but along with technical 493 developments in depth imaging hardware (for video games, self-driving cars and other industrial applications), 494 there are exciting developments in analysis, including deep leaning methods for detection of deformable ob-495 jects from image [69][70][71] and point-cloud 72,73 data, geometric and graph-based tricks for GPU-accelerated analysis 496 of 3D data 74-76 , and methods for physical modeling of deformable bodies 77-79 . 497…”

mentioning

confidence: 99%

Automatic mapping of multiplexed social receptive fields by deep learning and GPU-accelerated 3D videography

Ebbesen

Froemke

2020

Preprint

View full text Add to dashboard Cite

13Social interactions powerfully impact both the brain and the body, but high-resolution descriptions of these 14 important physical interactions are lacking. Currently, most studies of social behavior rely on labor-intensive 15 methods such as manual annotation of individual video frames. These methods are susceptible to experimenter 16 bias and have limited throughput. To understand the neural circuits underlying social behavior, scalable and 17 objective tracking methods are needed. We present a hardware/software system that combines 3D videography, 18 deep learning, physical modeling and GPU-accelerated robust optimization. Our system is capable of fully 19 automatic multi-animal tracking during naturalistic social interactions and allows for simultaneous electro-20 physiological recordings. We capture the posture dynamics of multiple unmarked mice with high spatial (~2 21 mm) and temporal precision (60 frames/s). This method is based on inexpensive consumer cameras and is 22 implemented in python, making our method cheap and straightforward to adopt and customize for studies of 23 neurobiology and animal behavior. 24 RESULTS 67 Raw data acquisition 68We established an experimental setup that allowed us to capture synchronized color images and depth images 69 from multiple angles, while simultaneously recording synchronized neural data ( Fig. 1a). We used inexpen-70 sive, state-of-the-art 'depth cameras' for computer vision and robotics. These cameras contain several imaging 71 modules: one color sensor, two infrared sensors and an infrared laser projector ( Fig. 1b). Imaging data pipe-72 lines, as well as intrinsic and extrinsic sensor calibration parameters can be accessed over USB through a 73 C/C++ SDK with Python bindings. We placed four depth cameras, as well as four synchronization LEDs 74 around a transparent acrylic cylinder which served as our behavioral arena ( Fig. 1c). 75 76 Each depth camera projects a static dot pattern across the imaged scene, adding texture in the infrared spec-77 trum to reflective surfaces ( Fig. 1d). By imaging this highly-textured surface simultaneously with two infrared 78 sensors per depth camera, it is possible to estimate the distance of each pixel in the infrared image to the depth 79 camera by stereopsis (by locally estimating the binocular disparity between the textured images). Since the 80 dot pattern is static and only serves to add texture, multiple cameras do not interfere with each other and it is 81 possible to image the same scene from multiple angles. This is one key aspect of our method, not possible 82 with depth imaging systems that rely on actively modulated light (such as the Microsoft Kinect system and 83 earlier versions of the Intel Realsense cameras). 84 85Since mouse movement is fast 13 , it is vital to minimize motion blur in the infrared images and thus the final 86 3D data ('point-cloud'). To this end, our method relies on two key features. First, we use depth cameras where 87 the infrared sensors have a global shutter (e.g., Intel D435) rathe...

show abstract

C3DPO: Canonical 3D Pose Networks for Non-Rigid Structure From Motion

Cited by 119 publications

References 37 publications

Animals in Virtual Environments

Animals in Virtual Environments

Unsupervised Learning of Category-Specific Symmetric 3D Keypoints from Point Sets

Automatic mapping of multiplexed social receptive fields by deep learning and GPU-accelerated 3D videography

Contact Info

Product

Resources

About