2015 IEEE Winter Conference on Applications of Computer Vision 2015
DOI: 10.1109/wacv.2015.114
|View full text |Cite
|
Sign up to set email alerts
|

Pose Estimation of Object Categories in Videos Using Linear Programming

Abstract: In this paper we propose a method to consistently recover the pose of an object from a known class in a video sequence. As individual poses estimated from monocular images are rather noisy, we optimally aggregate pose evidence over all video frames. We construct a graph where nodes are values sampled from the pose posterior distributions computed by a continuous pose estimator in each frame of the sequence. We then find the globally optimum pose path through the graph that best explains the pose evidence for t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2015
2015
2017
2017

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 26 publications
0
3
0
Order By: Relevance
“…In recent years, also object-based alignment over multiple frames has been investigated for scene understanding in urban street scenes. Feni et al [7] estimate the yaw angle of vehicles from annotated 2D detections. Menze et al [14] model moving rigid objects for improving stereo scene flow estimation between two consecutive frames.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In recent years, also object-based alignment over multiple frames has been investigated for scene understanding in urban street scenes. Feni et al [7] estimate the yaw angle of vehicles from annotated 2D detections. Menze et al [14] model moving rigid objects for improving stereo scene flow estimation between two consecutive frames.…”
Section: Related Workmentioning
confidence: 99%
“…shape prior (7) In the following, we will explain the individual terms of the energy function in detail. Figure 4 shows an illustration of all the energy terms.…”
Section: Cost Functionmentioning
confidence: 99%
“…First, we evaluate our method by comparing with [30], as none of the two methods exploit temporal information, i.e., pose estimation is performed separately in each frame. Then, in order to compare with [31], we extend our model with a Linear Programming (LP) formulation to take temporal information into account, similarly to [7]. For this purpose, we exploit the posterior distribution delivered by our method trained on the first 10 sequences of the EPFL dataset.…”
Section: Mean Ae [ • ] Median Ae [ • ]mentioning
confidence: 99%