2023
DOI: 10.1109/tpami.2022.3222784
|View full text |Cite
|
Sign up to set email alerts
|

AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time

Abstract: Accurate whole-body multi-person pose estimation and tracking is an important yet challenging topic in computer vision. To capture the subtle actions of humans for complex behavior analysis, whole-body pose estimation including the face, body, hand and foot is essential over conventional body-only pose estimation. In this paper, we present AlphaPose, a system that can perform accurate whole-body pose estimation and tracking jointly while running in realtime. To this end, we propose several new techniques: Symm… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
91
0
1

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 229 publications
(92 citation statements)
references
References 81 publications
0
91
0
1
Order By: Relevance
“…After image processing, the algorithm integrating YOLOv3 and AlphaPose [ 75 ] was used to detect human poses. The algorithm includes the three main components [ 76 ].…”
Section: Resultsmentioning
confidence: 99%
“…After image processing, the algorithm integrating YOLOv3 and AlphaPose [ 75 ] was used to detect human poses. The algorithm includes the three main components [ 76 ].…”
Section: Resultsmentioning
confidence: 99%
“…One of the first approaches to do so, was a declination of Mask-RCNN, which added a joint regression head to its instance segmentation backbone [ 44 ]. Noteworthy methods from the state-of-the-art such as UniPose [ 20 ], OpenPose [ 45 ] or Alpha-Pose [ 46 ]. In this work we adopt UniPose [ 20 ] as a source for extracting body joints and train our model to estimate action progress.…”
Section: Related Workmentioning
confidence: 99%
“…For MMPose, each image is cropped at the bounding box and this region is passed to a 2D keypoint detector. In this work, the specific architecture we use is an HRNet [21] with a channel width of 48 that is pretrained on the Halpe dataset [22]. We selected this because, in contrast to the commonly used COCO dataset Fig.…”
Section: Data Acquisitionmentioning
confidence: 99%