2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00707
|View full text |Cite
|
Sign up to set email alerts
|

PoseFlow: A Deep Motion Representation for Understanding Human Behaviors in Videos

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
27
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 37 publications
(27 citation statements)
references
References 20 publications
0
27
0
Order By: Relevance
“…HMMR learns a temporal representation for human dynamics by incorporating large-scale 2D pseudo-ground truth poses for in-the-wild videos. It uses PoseFlow (Zhang et al 2018)and Alpha-Pose (Fang et al 2017) for multi-person 2D pose estimation and tracking as a pre-processing step. Each person crop is then given as input to the CNN for estimating the pose and shape, as well as the weak-perspective camera parameters.…”
Section: D Human Motion Estimationmentioning
confidence: 99%
“…HMMR learns a temporal representation for human dynamics by incorporating large-scale 2D pseudo-ground truth poses for in-the-wild videos. It uses PoseFlow (Zhang et al 2018)and Alpha-Pose (Fang et al 2017) for multi-person 2D pose estimation and tracking as a pre-processing step. Each person crop is then given as input to the CNN for estimating the pose and shape, as well as the weak-perspective camera parameters.…”
Section: D Human Motion Estimationmentioning
confidence: 99%
“…Video pose estimation has attracted less attention compared with image-based pose estimation mainly because of the limited number of large-scale benchmarks in video domain. Existing research works focus on extracting temporal cues, such as optical flow [1]- [3], [20], to help refine framewise estimation results generated by large networks. Song et al [1] propose a deep spatio-temporal network, namely Thin-Slicing, which aligns joint heatmaps across frames based on dense optical flow computation.…”
Section: B Human Pose Estimation In Videosmentioning
confidence: 99%
“…1(a), LSTM units are employed to transfer temporal knowledge as hidden states. Besides, optical flow is also widely exploited [1]- [3] The associate editor coordinating the review of this manuscript and approving it for publication was Shuhan Shen.…”
Section: Introductionmentioning
confidence: 99%
“…The performance of human action recognition has an important influence on many other tasks like video understanding and video surveillance. Many works have been proposed with different input modalities, including RGB video [1], [2], [9], [10], optical flow [4], [11] and human 2D/3D skeletons [8], [12], [13] (the optical flow and human skeletons can be estimated directly from the RGB video). Comparing to RGB video and optical flow, skeleton data is computationally more efficient and is robust to the variations in clothing and illumination.…”
Section: Introductionmentioning
confidence: 99%