2017
DOI: 10.1145/3072959.3073596
|View full text |Cite
|
Sign up to set email alerts
|

VNect

Abstract: Fig. 1. We recover the full global 3D skeleton pose in real-time from a single RGB camera, even wireless capture is possible by streaming from a smartphone (left). It enables applications such as controlling a game character, embodied VR, sport motion analysis and reconstruction of community video (right). Community videos (CC BY) courtesy of Real Madrid C.F. [2016] and RUSFENCING-TV [2017].We present the first real-time method to capture the full global 3D skeletal pose of a human in a stable, temporally con… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
118
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 858 publications
(119 citation statements)
references
References 102 publications
(112 reference statements)
1
118
0
Order By: Relevance
“…The results in Table. 10 show that our DepthMap representation achieves very close performance with the original design of Zhou et al [51], while saving about 1/5 network parameter (from the depth-regression sub-network). We also compare with Mehta et al [20], who also use a map representation for 3D coordinates. Instead of directly using the (u, v) coordinate from 2D heatmap (with a weak-perspective camera model), they regress the full (x, y, z) coordinates at the peak heatmap location with a full-perspective camera model.…”
Section: Human Pose Estimationmentioning
confidence: 95%
“…The results in Table. 10 show that our DepthMap representation achieves very close performance with the original design of Zhou et al [51], while saving about 1/5 network parameter (from the depth-regression sub-network). We also compare with Mehta et al [20], who also use a map representation for 3D coordinates. Instead of directly using the (u, v) coordinate from 2D heatmap (with a weak-perspective camera model), they regress the full (x, y, z) coordinates at the peak heatmap location with a full-perspective camera model.…”
Section: Human Pose Estimationmentioning
confidence: 95%
“…Particularly, experimental studies in the field of neuroeconomics and consumer behavior employ increasingly more ecologically valid paradigms (Hui et al, 2009;Rangel et al, 2008;Sanfey et al, 2006;Sharp et al, 2012); the current study suggests that finegrained recording of full-body movements in the laboratory can allow to track the time course of the underlying cognitive processes. Furthermore, the current work provides basis for principally new field studies tracking decision-makers' behavior in the real world: recent progress in computer vision (e.g., Toshev & Szegedy, 2014;Cao et al, 2017;Mehta et al, 2017) enables the researchers to reconstruct walking trajectories during decision making from video stream data, eliminating the need for dedicated motion capture hardware. These trajectories can then be used for reverse inference of the walkers' cognitive processes across domains.…”
Section: Wider Implicationsmentioning
confidence: 99%
“…However, these methods only capture 2D skeletal information. Predicting 3D poses directly from 2D RGB images has been demonstrated using offline methods [Bogo et al 2016;Tekin et al 2016;Zhou et al 2016] and in online settings [Mehta et al 2017]. Monocular depth cameras provide additional information and have been shown to aid robust skeletal tracking [Ganapathi et al 2012;Ma and Wu 2014;Shotton et al 2013;Taylor et al , 2012 and enable dense surface reconstruction even under deformation [Dou et al 2016;Newcombe et al 2015;Zollhöfer et al 2014].…”
Section: Related Workmentioning
confidence: 99%