2021
DOI: 10.1007/s10462-021-10107-y
|View full text |Cite
|
Sign up to set email alerts
|

A comparative review of graph convolutional networks for human skeleton-based action recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 27 publications
(12 citation statements)
references
References 67 publications
0
12
0
Order By: Relevance
“…The graph convolutional network uses a graph to save and transfer data and an adjacency matrix to save the connections between the nodes. Thereby, graph convolutional network has an advantage over other deep learning-based methods when dealing with graph data tasks such as skeleton-based action recognition [16][17][18]. In the literature [19], Yan et al propose a spatio-temporal graph convolutional network (ST-GCN) named.…”
Section: Related Workmentioning
confidence: 99%
“…The graph convolutional network uses a graph to save and transfer data and an adjacency matrix to save the connections between the nodes. Thereby, graph convolutional network has an advantage over other deep learning-based methods when dealing with graph data tasks such as skeleton-based action recognition [16][17][18]. In the literature [19], Yan et al propose a spatio-temporal graph convolutional network (ST-GCN) named.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, GCNs have emerged as SOTA approaches that effectively exploit the geometric structures of skeleton graphs to recognize human action and interaction [38]. Yan et al [19] introduce ST-GCN as a pioneering backbone for future GCN-based models.…”
Section: A Gcn-based Human Interaction Recognitionmentioning
confidence: 99%
“…On the other hand, skeleton-based methods predict the action class from the body’s skeletal structure and its motion [15, 11]. They thus rely on an additional preprocessing step called pose estimation , in which the body parts, such as joints and bones, are detected and their coordinates extracted from each video frame [6].…”
Section: Introductionmentioning
confidence: 99%
“…Comparatively, skeleton-based methods are less sensitive to these changes [18] and, given a robust pose estimator, are therefore likely to maintain a high action recognition accuracy. Furthermore, extracting the animal’s pose coordinates from high-dimensional video segments drastically reduces the network’s input space dimensionality, its computational complexity, and overall power consumption [15], which can represent game-changing advantages for field researchers with limited computational resources. Finally, identifying the pose offers a pre-computed geometrical quantification of the animal’s body motion and behavioral changes [67, 45].…”
Section: Introductionmentioning
confidence: 99%