2018
DOI: 10.1109/tpami.2017.2771306
|View full text |Cite
|
Sign up to set email alerts
|

Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates

Abstract: Abstract-Skeleton-based human action recognition has attracted a lot of research attention during the past few years. Recent works attempted to utilize recurrent neural networks to model the temporal dependencies between the 3D positional configurations of human body joints for better analysis of human activities in the skeletal data. The proposed work extends this idea to spatial domain as well as temporal domain to better analyze the hidden sources of action-related information within the human skeleton sequ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
257
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 441 publications
(257 citation statements)
references
References 93 publications
(99 reference statements)
0
257
0
Order By: Relevance
“…SYSU dataset: For the empirical evaluations, we compare our DACNN algorithm to other baselines including CNN+DPRL [28], ST-LSTM+Trust Gate [13], Dynamic Skeletons [35], LAFF(SKL) [45], SR-TSL [46], VA-LSTM [47], and GCA-LSTM [48], which includes the most recent deep learning applications (CNN, LSTM, etc.) on this dataset.…”
Section: Action Recognition Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…SYSU dataset: For the empirical evaluations, we compare our DACNN algorithm to other baselines including CNN+DPRL [28], ST-LSTM+Trust Gate [13], Dynamic Skeletons [35], LAFF(SKL) [45], SR-TSL [46], VA-LSTM [47], and GCA-LSTM [48], which includes the most recent deep learning applications (CNN, LSTM, etc.) on this dataset.…”
Section: Action Recognition Resultsmentioning
confidence: 99%
“…* Preprint of the publication [1] including extended experiments, as provided by the authors. The final publication is available at https://ieeexplore.ieee.org/ networks (RNN) [8,12,13]. RNN methods can learn the temporal dynamics of the sequential data; nevertheless, they have practical shortcomings in the training of their stacked structures [14,15].Compared to RNN architectures, CNN-based methods provide more effective solutions by extracting local features from their input and finding discriminative patterns in the data [16,10].…”
mentioning
confidence: 99%
“…Before the raw data can be input int the action recognition module, human key-point coordinates must be generated using the pose estimation technique. Using of human key-point coordinates to train the action recognition module will help to reduce the background clutter [37,38]. Also, it will reduce the computational complexity as compared to using the entire image/video to train the module [39,40].…”
Section: Data Pre-processingmentioning
confidence: 99%
“…They proposed an extension of long short term memory (LSTM) model which leverages group motion of several body joints to recognize human activity from skeletal data. A different adaptation of the LSTM model was proposed by Liu et al [14] where spatial interaction among joints was considered in addition to the temporal dynamics. Veeriah et al [15] proposed a LSTM network to capture the salient motion pattern of body joints.…”
Section: Literature Reviewmentioning
confidence: 99%