2022
DOI: 10.1109/lsp.2022.3142675
|View full text |Cite
|
Sign up to set email alerts
|

MTT: Multi-Scale Temporal Transformer for Skeleton-Based Action Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
14
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 41 publications
(14 citation statements)
references
References 26 publications
0
14
0
Order By: Relevance
“…TEM selects multiple adjacent joints between frames, which helps to extract the relevant features of multiple adjacent joints connected in human motion. Multi-scale temporal transformer (MTT) 12 takes into account patterns at various time scales and designs multiple branches to extract various timescale features. MTT learns from skeleton sequences and models long-term time.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…TEM selects multiple adjacent joints between frames, which helps to extract the relevant features of multiple adjacent joints connected in human motion. Multi-scale temporal transformer (MTT) 12 takes into account patterns at various time scales and designs multiple branches to extract various timescale features. MTT learns from skeleton sequences and models long-term time.…”
Section: Related Workmentioning
confidence: 99%
“…Compared with the previous convolutional neural networks (CNNs)-based methods [2][3][4][5] and recurrent neural networks (RNNs)-based methods, [6][7][8][9] Graph convolutional networks (GCNs) have good performance on any graph structure, and scholars are increasling using it for skeleton-based action recognition. [10][11][12][13][14][15][16] Yan et al first propose spatial-temporal GCN (ST-GCN) 17 to apply GCN to skeleton-based action recognition. According to the particularity of skeleton data, the relationship between different joints in the same dimension and the connection between joints in different dimensions are critical.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…They propose a hybrid model by integrating autoregressive and non-autoregressive models [8]. Transformer-based models can also be adapted to skeleton-based action recognition tasks [9], [10].…”
Section: Introductionmentioning
confidence: 99%
“…Today, there has been some research using a transformer-based multi-scale method on many applications. Kong et al [22] proposed a multi-scale temporal transformer for skeletonbased action recognition.Xiao et al [23] proposed a multi-scale spatiotemporal transformer to efficiently aggregate contextual information in long-time sequences of video frames. Yuan et al [24] proposed a multi-scale adaptive segmentation network based on Swin Transformer for remote sensing image segmentation.…”
Section: Introductionmentioning
confidence: 99%