2021 IEEE International Conference on Image Processing (ICIP) 2021
DOI: 10.1109/icip42928.2021.9506588
|View full text |Cite
|
Sign up to set email alerts
|

Rgb-D Fusion For Point-Cloud-Based 3d Human Pose Estimation

Abstract: 3D human pose estimation is an important and challenging task in computer vision. In this paper, we propose a method to estimate 3D human pose from RGB-D images. We adopt a 2D pose estimator to extract color features from the RGB image. The color features are integrated with the depth image in the form of point cloud. To fully exploit geometric information, we design a 3D learning module to extract point-wise features. To take advantage of local information as well as facilitate the convergence of the model, w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(2 citation statements)
references
References 17 publications
0
2
0
Order By: Relevance
“…Mono‐camera approaches with optimization techniques [BKL*16, KPD19] and neural networks [PZDD17,WLLL22,HPY*22] lack depth information and struggle to track global translations. Despite offering an additional depth channel, RGBD‐based solutions [BMB*11, MSS*17, YZ21] are hindered by limited camera resolution and a field of view (FOV), which makes them impractical for product‐level applications.…”
Section: Related Workmentioning
confidence: 99%
“…Mono‐camera approaches with optimization techniques [BKL*16, KPD19] and neural networks [PZDD17,WLLL22,HPY*22] lack depth information and struggle to track global translations. Despite offering an additional depth channel, RGBD‐based solutions [BMB*11, MSS*17, YZ21] are hindered by limited camera resolution and a field of view (FOV), which makes them impractical for product‐level applications.…”
Section: Related Workmentioning
confidence: 99%
“…Figure 2 shows the idea of the proposed method. While we use PointNet [22]-inspired architecture as the main point cloud processing network, we cannot fuse camera and Li-DAR imagery at the lower levels like in other settings [38] because of the sparsity of LiDAR. We propose a cascade architecture with a CNN-based camera network for 2D pose estimation.…”
Section: Introductionmentioning
confidence: 99%