HumanEva: Synchronized Video and Motion Capture Dataset and Baseline Algorithm for Evaluation of Articulated Human Motion

Sigal, Leonid; Balan, A.O.; Black, Michael J.

doi:10.1007/s11263-009-0273-6

Cited by 1,151 publications

(984 citation statements)

References 78 publications

Supporting

Mentioning

947

Contrasting

Unclassified

Order By: Relevance

“…13 salient points on human body: head center, right shoulder, right elbow, right hand, left shoulder, left elbow, left hand, right hip, right knee, right foot (ankle), left hip, left knee, left foot (ankle) were manually marked for all videos in the corpus. We build upon the pose error metric proposed in [21] and define the following pose evaluation metrics for each vignette in the corpus: (a) Average error per frame as in (5), (b) Average error per marker per frame (D aepmpf ) (average of (5) for number of markers) , (c) Average error for different markers per frame as in (6).…”

Section: Methodsmentioning

confidence: 99%

An Optimization Based Framework for Human Pose Estimation in Monocular Videos

Agarwal

Kumar

Ryde

et al. 2012

Advances in Visual Computing

View full text Add to dashboard Cite

Abstract. Human pose estimation using monocular vision is a challenging problem in computer vision. Past work has focused on developing efficient inference algorithms and probabilistic prior models based on captured kinematic/dynamic measurements. However, such algorithms face challenges in generalization beyond the learned dataset. In this work, we propose a model-based generative approach for estimating the human pose solely from uncalibrated monocular video in unconstrained environments without any prior learning on motion capture/image annotation data. We propose a novel Product of Heading Experts (PoHE) based generalized heading estimation framework by probabilistically-merging heading outputs (probabilistic/ non-probabilistic) from time varying number of estimators to bootstrap a synergistically integrated probabilistic-deterministic sequential optimization framework for robustly estimating human pose. Novel pixel-distance based performance measures are developed to penalize false human detections and ensure identity-maintained human tracking. We tested our framework with varied inputs (silhouette and bounding boxes) to evaluate, compare and benchmark it against ground-truth data (collected using our human annotation tool) for 52 video vignettes in the publicly available DARPA Mind's Eye Year I dataset 1 . Results show robust pose estimates on this challenging dataset of highly diverse activities.

show abstract

Section: Methodsmentioning

confidence: 99%

An Optimization Based Framework for Human Pose Estimation in Monocular Videos

Agarwal

Kumar

Ryde

et al. 2012

Advances in Visual Computing

View full text Add to dashboard Cite

show abstract

“…This is already pretty accurate but the black curves show an even better and smoother tracking with a deviation of up to just 2mm. In the second experiment we took a sequence of the HumanEVA-II benchmark [12]. Here a surface model, calibrated image sequences, and background images are provided.…”

Section: Methodsmentioning

confidence: 99%

“…It depicts the unconstrained results in red and the constrained results in black. Table 1 compares the errors (automatically evaluated [12]). Overall the tracking has been improved remarkably using the additional ground plane constraint.…”

Section: Methodsmentioning

confidence: 99%

“…exp(θ jξ j )X i (12) and we can generate a set of equations forcing the transformed point X i to stay close to X i :…”

Section: Soft-constraints For Penalizing Floor Intersectionsmentioning

confidence: 99%

“…Moreover, it is verified to which extent such constraints improve the tracking performance. In particular, we show results on the recent HumanEVA-II [12] benchmark, which involves a quantitative error analysis.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Staying Well Grounded in Markerless Motion Capture

Rosenhahn

Schmaltz

Brox³

et al. 2008

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. In order to overcome typical problems in markerless motion capture from video, such as ambiguities, noise, and occlusions, many techniques reduce the high dimensional search space by integration of prior information about the movement pattern or scene. In this work, we present an approach in which geometric prior information about the floor location is integrated in the pose tracking process. We penalize poses in which body parts intersect the ground plane by employing soft constraints in the pose estimation framework. Experiments with rigid objects and the HumanEVA-II benchmark show that tracking is remarkably stabilized.

show abstract