2024
DOI: 10.1109/tnnls.2022.3201518
|View full text |Cite
|
Sign up to set email alerts
|

Fusing Higher-Order Features in Graph Neural Networks for Skeleton-Based Action Recognition

Abstract: Skeleton sequences are lightweight and compact and thus are ideal candidates for action recognition on edge devices. Recent skeleton-based action recognition methods extract features from 3-D joint coordinates as spatial-temporal cues, using these representations in a graph neural network for feature fusion to boost recognition performance. The use of first-and second-order features, that is, joint and bone representations, has led to high accuracy. Nonetheless, many models are still confused by actions that h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
18
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 53 publications
(18 citation statements)
references
References 39 publications
0
18
0
Order By: Relevance
“…Caetano et al (2019a) performed feature extraction by calculating the size and direction values of the skeleton to recode the skeleton sequence and map it into a picture. Qin et al (2021) combined spatio-temporal graph convolutional networks to extract higher-order features of the action by defining several special angular encodings between joints. Referring to the above studies, we first associate the possibility based on edge-level, and secondly, the human edge-vector exists in different directions, and the design of higher-order features in different directions considering the motion features of different directions.…”
Section: Feature Definition For Skeleton-based Human Action Recognitionmentioning
confidence: 99%
“…Caetano et al (2019a) performed feature extraction by calculating the size and direction values of the skeleton to recode the skeleton sequence and map it into a picture. Qin et al (2021) combined spatio-temporal graph convolutional networks to extract higher-order features of the action by defining several special angular encodings between joints. Referring to the above studies, we first associate the possibility based on edge-level, and secondly, the human edge-vector exists in different directions, and the design of higher-order features in different directions considering the motion features of different directions.…”
Section: Feature Definition For Skeleton-based Human Action Recognitionmentioning
confidence: 99%
“…However, due to the need to calculate multiple branches at the same time, the calculation cost was high. Literature [2,14] is a model mainly based on 3D convolution. It can be seen that the model mainly focuses on the improvement of accuracy, and the model is relatively complex and the calculation is slow.…”
Section: Comparison With Other Action Recognition Algorithmsmentioning
confidence: 99%
“…Literature [7] and literature [10] have computational costs similar to that of this paper, but the accuracy is 5.3% and 0.6% lower than that of this paper, respectively. Literature [11][12][13][14][15][16] has a higher performance in recognition accuracy, but also has a larger amount of computation and parameters. Therefore, the method proposed in this paper balances the three performance indexes of calculation cost, parameter number and recognition accuracy well, and ensures a high recognition accuracy while reducing parameter number and calculation cost.…”
Section: Comparison With Other Action Recognition Algorithmsmentioning
confidence: 99%
“…In recent years, researchers have become increasingly interested in GCN-based methods [35]- [39] which can reflect the structured relationships between skeletons. Most studies have focused on spatial modeling, which contains pre-defined [7], learnable [8], and dynamic [11] ways.…”
Section: A Skeleton-based Action Recognitionmentioning
confidence: 99%