2020
DOI: 10.1007/978-3-030-58565-5_33
|View full text |Cite
|
Sign up to set email alerts
|

InterHand2.6M: A Dataset and Baseline for 3D Interacting Hand Pose Estimation from a Single RGB Image

Abstract: Analysis of hand-hand interactions is a crucial step towards better understanding human behavior. However, most researches in 3D hand pose estimation have focused on the isolated single hand case. Therefore, we firstly propose (1) a large-scale dataset, InterHand2.6M, and (2) a baseline network, InterNet, for 3D interacting hand pose estimation from a single RGB image. The proposed InterHand2.6M consists of 2.6M labeled single and interacting hand frames under various poses from multiple subjects. Our InterNet… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
327
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 190 publications
(329 citation statements)
references
References 42 publications
2
327
0
Order By: Relevance
“…We believe that our dataset will allow stereo-vision research communities to promote their own studies. We also note that our dataset is still smaller in size than the datasets that have been recently introduced for RGB and RGBD sensors [8], [9], [36], [48]. Additionally, these other datasets include various gestures, such as mid-air gestures and object interaction gestures.…”
Section: ) Discussion and Limitationsmentioning
confidence: 99%
See 4 more Smart Citations
“…We believe that our dataset will allow stereo-vision research communities to promote their own studies. We also note that our dataset is still smaller in size than the datasets that have been recently introduced for RGB and RGBD sensors [8], [9], [36], [48]. Additionally, these other datasets include various gestures, such as mid-air gestures and object interaction gestures.…”
Section: ) Discussion and Limitationsmentioning
confidence: 99%
“…STB contains 18 K pairs of stereo images from real-world scenarios. Although STB is widely used as a benchmark dataset for hand pose estimation [6], [9], [34], [36], [45], it contains hand images captured from a third-person perspective, which does not align with our research goals. In this study, we collected a novel dataset using a stereo sensor from an egocentric view.…”
Section: B Datasets For Hand Pose Estimationmentioning
confidence: 99%
See 3 more Smart Citations