2022
DOI: 10.1109/tcsvt.2022.3142787
|View full text |Cite
|
Sign up to set email alerts
|

3D Hand Pose Estimation From Monocular RGB With Feature Interaction Module

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(3 citation statements)
references
References 59 publications
0
3
0
Order By: Relevance
“…Our scale attention is proposed to fuse the similarity response map, while most of the previous approaches use attentional mechanisms to improve the representation of features. (2) The implementations are different from previous work [24]. We apply global average pooling to extract the spatial information from the feature map, and then integrate all channels together by convolution.…”
Section: Attentional Mechanismsmentioning
confidence: 99%
See 1 more Smart Citation
“…Our scale attention is proposed to fuse the similarity response map, while most of the previous approaches use attentional mechanisms to improve the representation of features. (2) The implementations are different from previous work [24]. We apply global average pooling to extract the spatial information from the feature map, and then integrate all channels together by convolution.…”
Section: Attentional Mechanismsmentioning
confidence: 99%
“…Object tracking is a basic challenge that involves predicting the target state in the video based on the initial state. It has several uses, including visual surveillance [1], pose estimation [2], and autonomous vehicles [3]. Therefore, it is a very active research direction.…”
Section: Introductionmentioning
confidence: 99%
“…This type of method can be further divided into detection-based and regression-based methods. The detection-based method is to generate a heat map to get the final predicted node joints [ 11 , 12 ], while the regression-based method is to directly regress the node position of all hand joints. A deep network was proposed in [ 13 ], which used a segmentation network to locate the hand, and output the scoring map of each joint point by PoseNet.…”
Section: Related Workmentioning
confidence: 99%