2022
DOI: 10.1016/j.neucom.2022.09.071
|View full text |Cite
|
Sign up to set email alerts
|

Action recognition based on RGB and skeleton data sets: A survey

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 39 publications
(13 citation statements)
references
References 104 publications
0
13
0
Order By: Relevance
“…Yang et al. [5] proposed using DMM (Depth Motion Maps) to describe the behaviour's 3D structure and shape information. In order to make use of the different body shapes and motion information in the depth map, each depth frame was projected onto the three orthogonal Descartes planes.…”
Section: Multimodal Cooperative Self‐attention Networkmentioning
confidence: 99%
See 2 more Smart Citations
“…Yang et al. [5] proposed using DMM (Depth Motion Maps) to describe the behaviour's 3D structure and shape information. In order to make use of the different body shapes and motion information in the depth map, each depth frame was projected onto the three orthogonal Descartes planes.…”
Section: Multimodal Cooperative Self‐attention Networkmentioning
confidence: 99%
“…Yang et al [5] proposed using DMM (Depth Motion Maps) to describe the behaviour's 3D structure and shape information.…”
Section: Depth Self-attention Subnetworkmentioning
confidence: 99%
See 1 more Smart Citation
“…Conventional methods utilize cost-effective modalities, such as RGB-D videos, which provide rich information for recognizing human interaction. Compared to RGB-D data, skeleton data have lower computational costs and are more robust against noises, background changes, and occlusion, making them the optimal choice for many researchers [2], [3]. Numerous pose estimation algorithms have been developed to extract 3D key-point coordinates of human skeletons from videos [4]- [8].…”
Section: Introductionmentioning
confidence: 99%
“…From the view of the intention of the one-pixel-wide lines, skeletons generally are used to represent the shape features and topological structures of the original object [2]. This kind of thinning algorithms usually operates on binary images and is used in the applications of object detection tasks [3,4], action recognition [5][6][7], and remote sensing image analysis [8]. Thinned lines and curve representations obtained from the edges which commonly have a limited number of pixels in the gradient direction are usually used in fingerprint recognition [9], handwritten character recognition [10,11], and medical analysis [12].…”
Section: Introductionmentioning
confidence: 99%