2021
DOI: 10.1007/s10489-021-02367-6
|View full text |Cite
|
Sign up to set email alerts
|

Linear dynamical systems approach for human action recognition with dual-stream deep features

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(8 citation statements)
references
References 40 publications
0
8
0
Order By: Relevance
“…Rest of the methods that include multi-task hierarchical clustering [57], BT-LSTM [58], deep autoencoder [59], two-stream attention LSTM [60], weighted entropy-variance based feature selection [61], dilated CNN+BiLSTM+RB [62], DS-GRU [43], and local-global features + QSVM [63] obtain 89.7%, 85.3%, 96.2%, 96.9%, 94.5%, 89.0%, 97.1%, and 82.6% accuracies, respectively. For the UCF50 dataset, the proposed method dominates the state-of-the-art methods by obtaining the best accuracy of 97.5%, whereas the (LD-BF) + (LD-DF) [64] obtains the second-based accuracy of 96.7%. The local-global features + QSVM [63] achieves the lowest accuracy of 69.4%, whereas the rest of the methods including multi-task hierarchical clustering [57], deep autoencoder [59], ensemble model with sward-based optimization [65], and DS-GRU [43] obtain [57] 2017 89.7 BT-LSTM [58] 2018 85.3 Deep autoencoder [59] 2019 96.2 STDN [56] 2020 98.2 Two-stream attention LSTM [60] 2020 96.9 Weighted entropy-variances based feature selection [61] 2021 94.5 Dilated CNN+BiLSTM+RB [62] 2021 89.0 DS-GRU [43] 2021 97.1 Local-global features + QSVM [63] 2021 82.6 DA-CNN+Bi-GRU (Proposed) 2022 98.0 Finally, for the HMDB51 dataset comprising of challenging action videos, our proposed method achieves the best results by obtaining an accuracy of 79.3%, whereas the runnerup method is evidential deep learning [66] that attains an accuracy of 77.0%.…”
Section: Comparison With State-of-the-art Methodsmentioning
confidence: 94%
“…Rest of the methods that include multi-task hierarchical clustering [57], BT-LSTM [58], deep autoencoder [59], two-stream attention LSTM [60], weighted entropy-variance based feature selection [61], dilated CNN+BiLSTM+RB [62], DS-GRU [43], and local-global features + QSVM [63] obtain 89.7%, 85.3%, 96.2%, 96.9%, 94.5%, 89.0%, 97.1%, and 82.6% accuracies, respectively. For the UCF50 dataset, the proposed method dominates the state-of-the-art methods by obtaining the best accuracy of 97.5%, whereas the (LD-BF) + (LD-DF) [64] obtains the second-based accuracy of 96.7%. The local-global features + QSVM [63] achieves the lowest accuracy of 69.4%, whereas the rest of the methods including multi-task hierarchical clustering [57], deep autoencoder [59], ensemble model with sward-based optimization [65], and DS-GRU [43] obtain [57] 2017 89.7 BT-LSTM [58] 2018 85.3 Deep autoencoder [59] 2019 96.2 STDN [56] 2020 98.2 Two-stream attention LSTM [60] 2020 96.9 Weighted entropy-variances based feature selection [61] 2021 94.5 Dilated CNN+BiLSTM+RB [62] 2021 89.0 DS-GRU [43] 2021 97.1 Local-global features + QSVM [63] 2021 82.6 DA-CNN+Bi-GRU (Proposed) 2022 98.0 Finally, for the HMDB51 dataset comprising of challenging action videos, our proposed method achieves the best results by obtaining an accuracy of 79.3%, whereas the runnerup method is evidential deep learning [66] that attains an accuracy of 77.0%.…”
Section: Comparison With State-of-the-art Methodsmentioning
confidence: 94%
“…For the UCF50 dataset, the proposed Student 3DCNN -TUTL method outperforms all the comparative methods by obtaining the best accuracy of 97.6% followed by the runnerup LD-BF + LD-DF method [65], which achieves an accuracy of 97.5%. The local-global features + QSVM [45] method attains the lowest accuracy of 69.4% amongst all comparative method on the UCF50 dataset.…”
Section: 3mentioning
confidence: 94%
“…Multi-task hierarchical clustering [46] 2016 93.2 Deep autoencoder [48] 2019 96.4 Ensembled swarm-based optimization [64] 2021 92.2 DS-GRU [52] 2021 95.2 Local-global features + QSVM [45] 2021 69.4 LD-BF + LD-DF [65] 2022 97.5 Student 3DCNN -TUTL (Ours) 2023 97.6…”
Section: Model Year Accuracy (%)mentioning
confidence: 99%
“…Laboratory experiments show that based on motion capture data and 3D avatars, you can train with only simulation data A recurrent neural network achieves almost perfect results in classifying human actions on real data [10] . Du & Mukaidani discussed a two-stream structure human action recognition method based on a linear dynamic system, and proposed a dual-stream deep feature extraction framework based on a pre-processed convolutional neural network, and veri ed the effectiveness of the method [11] . Sedmidubsky and Zezula proposed an evaluation procedure for 3D human action recognition to determine the best combination in a very effective way [12] .…”
Section: Literature Reviewmentioning
confidence: 99%