2021
DOI: 10.1007/s11227-021-03957-4
|View full text |Cite
|
Sign up to set email alerts
|

A transfer learning-based efficient spatiotemporal human action recognition framework for long and overlapping action classes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 28 publications
(12 citation statements)
references
References 37 publications
0
12
0
Order By: Relevance
“…Deep learning-based solutions for computer vision have made it easier to deliver technical content for educational purposes or clinical use. It has been mentioned in other papers that conventional image data contain many hidden pieces of information and patterns that can be used for human activity recognition (HAR), and HAR can be applied to many areas such as behavioral analysis, intelligent video surveillance, and robot vision [36]. In terms of erroneous classification, existing hand-engineered and machine learning-based solutions have little or no ability to handle overlapping tasks.…”
Section: Discussionmentioning
confidence: 99%
“…Deep learning-based solutions for computer vision have made it easier to deliver technical content for educational purposes or clinical use. It has been mentioned in other papers that conventional image data contain many hidden pieces of information and patterns that can be used for human activity recognition (HAR), and HAR can be applied to many areas such as behavioral analysis, intelligent video surveillance, and robot vision [36]. In terms of erroneous classification, existing hand-engineered and machine learning-based solutions have little or no ability to handle overlapping tasks.…”
Section: Discussionmentioning
confidence: 99%
“…Many researchers now use RNNs and LSTMs to simulate activity changes over time. LRCNs can learn features from variable-length inputs, unlike traditional input methods [25]. Donahue et al [26] use encoder-decoder architecture to extract video representations.…”
Section: Convnets With Rnnsmentioning
confidence: 99%
“…Researchers are struggling to design systems that can apply experience learned from previous tasks to improve performance on a new task. Transfer learning can reduce the architecture learning time and effort, resulting in more robust and useful activity recognition systems [25]. Muhammad et al [23] reveal that transfer learning is a good approach to action recognition.…”
Section: Transfer Learning With Deep Learning As a Feature Extractormentioning
confidence: 99%
“…An approach with a novel temporal-spatial pooling block for action classification, which can learn pool discriminative frames and pixels in a certain clip, has been recently proposed in [ 90 ]. Similarly, in [ 91 ], an efficient spatiotemporal human action recognition framework for long and overlapping action classes has been proposed. Fine-tuned pre-trained CNN models were exploited to learn the spatial relationship at the frame level whereas an optimized Deep Autoencoder [ 92 ] was used to squeeze high-dimensional deep features.…”
Section: Recent Advances In Human Motion Analysismentioning
confidence: 99%