2019 9th IEEE International Conference on Control System, Computing and Engineering (ICCSCE) 2019
DOI: 10.1109/iccsce47578.2019.9068586
|View full text |Cite
|
Sign up to set email alerts
|

Vision-based Human Action Recognition on Pre-trained AlexNet

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 14 publications
0
3
0
Order By: Relevance
“…The CNN is tested using UCF101 and HMDB51 datasets with accuracies of 94.5% and 69.8%, respectively. Mohamed et al [36] also proposed a vision-based human action recognition via transfer learning. Their approach uses AlexNet as a pre-trained CNN to extract low-level features from three different image maps, i.e., motion history image that sustains spatiotemporal data, binary motion energy image that captures the motion region data, and optical flow information that holds accumulative motion speed data.…”
Section: B Fusion Methodsmentioning
confidence: 99%
“…The CNN is tested using UCF101 and HMDB51 datasets with accuracies of 94.5% and 69.8%, respectively. Mohamed et al [36] also proposed a vision-based human action recognition via transfer learning. Their approach uses AlexNet as a pre-trained CNN to extract low-level features from three different image maps, i.e., motion history image that sustains spatiotemporal data, binary motion energy image that captures the motion region data, and optical flow information that holds accumulative motion speed data.…”
Section: B Fusion Methodsmentioning
confidence: 99%
“…In order to maintain the consistency of the feature map size before the feature map fusion, we down-sample the output feature maps of Block1 to Block4. In network training, migration learning can reduce the time of network training [25], so the weight of the vgg16 pre-trained model on ImageNet is used when initializing the convolutional layer in the multi-layer fusion module [26].…”
Section: ) Multi-layer Fusion Modulementioning
confidence: 99%
“…We proposed using AlexNet [3], ResNet18 [4] and SqueezeNet1_0 [5] deep learning models to classify and evaluate the accuracy of dances to its class. Several works have been done to classify human movements such as proposed by Yildirim and Çinar [6], Kumar and Harikiran [7] and Zamri et al [8] that uses deep learning models. However, those works are similar in method whereby the authors utilized singular images of ISSN: 2252-8938  a human performing an action to train their deep learning models.…”
Section: Introductionmentioning
confidence: 99%