Attention module-based spatial–temporal graph convolutional networks for skeleton-based action recognition

Kong, Yinghui; Li, Li; Zhang, Ke; Ni, Qiang; Han, Jungong

doi:10.1117/1.jei.28.4.043032

Cited by 22 publications

(11 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Presently, deep learning methods for human action recognition are preferred over traditional skeleton-based ones, which tend to focus on extracting hand crafted features [15,39]. The former methods can be categorized into three major sets: methods based on Recurrent Neural Network (RNN) [19], methods based on Convolutional Neural Network (CNN) [7], and methods based on Graph Convolutional Network (GCN) [17].…”

Section: Action Recognitionmentioning

confidence: 99%

Geometric Deep Neural Network using Rigid and Non-Rigid Transformations for Human Action Recognition

Friji

Drira²,

Chaieb

et al. 2021

2021 IEEE/CVF International Conference on Computer Vision (ICCV)

View full text Add to dashboard Cite

Deep Learning architectures, albeit successful in most computer vision tasks, were designed for data with an underlying Euclidean structure, which is not usually fulfilled since pre-processed data may lie on a non-linear space. In this paper, we propose a geometry aware deep learning approach using rigid and non rigid transformation optimization for skeleton-based action recognition. Skeleton sequences are first modeled as trajectories on Kendall's shape space and then mapped to the linear tangent space. The resulting structured data are then fed to a deep learning architecture, which includes a layer that optimizes over rigid and non rigid transformations of the 3D skeletons, followed by a CNN-LSTM network. The assessment on two large scale skeleton datasets, namely NTU-RGB+D and NTU-RGB+D 120, has proven that the proposed approach outperforms existing geometric deep learning methods and exceeds recently published approaches with respect to the majority of configurations.

show abstract

Section: Action Recognitionmentioning

confidence: 99%

Geometric Deep Neural Network using Rigid and Non-Rigid Transformations for Human Action Recognition

Friji

Drira²,

Chaieb

et al. 2021

2021 IEEE/CVF International Conference on Computer Vision (ICCV)

View full text Add to dashboard Cite

show abstract

“…ST-GCN was quickly applied to motion recognition and traffic. For example, Kong Y et al [36] built a dynamic skeleton model based on ST-GCN, combined with the attention module; Geng X et al [37] proposed a spatial-temporal multigraph convolutional network based on ST-GCN (ST-MGCN) to forecast the demand for rides. ST-GCN has a good prediction effect on the transition of crimes in a topological space, but it attaches less importance to timing changes on a single node.…”

Section: Related Workmentioning

confidence: 99%

Risk Prediction of Theft Crimes in Urban Communities: An Integrated Model of LSTM and ST-GCN

Han

et al. 2020

IEEE Access

View full text Add to dashboard Cite

Urbanization has been speeding up social and economic transformations in urban communities, the smallest social units in a city. However, urbanization brings challenges to urban management and security. Therefore, a system of risk prediction of crimes may be essential to crime prevention and control in urban communities and its system improvement. To tackle crime-related problems in urban communities, this paper proposes a model of daily crime prediction by combining Long Short-Term Memory Network (LSTM) and Spatial-Temporal Graph Convolutional Network (ST-GCN) to automatically and effectively detect the high-risk areas in a city. Topological maps of urban communities carry the dataset in the model, which mainly includes two modules-spatial-temporal features extraction module and temporal feature extraction module-to extract the factors of theft crimes collectively. We have performed the experimental evaluation of the existing crime data from Chicago, America. The results show that the integrated model demonstrates positive performance in predicting the number of crimes within the sliding time range.

show abstract

“…Presently, deep learning methods for human action recognition are preferred over traditional skeleton-based ones, which tend to focus on extracting hand crafted features [17,39] The former methods can be categorized into three major sets: methods based on Recurrent Neural Network (RNN) [21], methods based on Convolutional Neural Network (CNN) [8], and methods based on Graph Convolutional Network (GCN) [20].…”

Section: Action Recognitionmentioning

confidence: 99%

KShapeNet: Riemannian network on Kendall shape space for Skeleton based Action Recognition

Friji¹,

Drira²,

Chaieb³

et al. 2020

Preprint

View full text Add to dashboard Cite

Deep Learning architectures, albeit successful in most computer vision tasks, were designed for data with an underlying Euclidean structure, which is not usually fulfilled since pre-processed data may lie on a non-linear space. In this paper, we propose a geometry aware deep learning approach for skeleton-based action recognition. Skeleton sequences are first modeled as trajectories on Kendall's shape space and then mapped to the linear tangent space. The resulting structured data are then fed to a deep learning architecture, which includes a layer that optimizes over rigid and non rigid transformations of the 3D skeletons, followed by a CNN-LSTM network. The assessment on two large scale skeleton datasets, namely NTU-RGB+D and NTU-RGB+D 120, has proven that proposed approach outperforms existing geometric deep learning methods and is competitive with respect to recently published approaches.

show abstract

Attention module-based spatial–temporal graph convolutional networks for skeleton-based action recognition

Cited by 22 publications

References 26 publications

Geometric Deep Neural Network using Rigid and Non-Rigid Transformations for Human Action Recognition

Geometric Deep Neural Network using Rigid and Non-Rigid Transformations for Human Action Recognition

Risk Prediction of Theft Crimes in Urban Communities: An Integrated Model of LSTM and ST-GCN

KShapeNet: Riemannian network on Kendall shape space for Skeleton based Action Recognition

Contact Info

Product

Resources

About