2022
DOI: 10.1109/access.2022.3171263
|View full text |Cite
|
Sign up to set email alerts
|

Ensembled Transfer Learning Based Multichannel Attention Networks for Human Activity Recognition in Still Images

Abstract: Human activity recognition is one of the most difficult tasks in computer vision. Due to the lack of time information, detecting human activities from still photos is more difficult than sensor-based or videobased techniques. Recently, various deep learning based solutions are being proposed one after another, and their performance is constantly improving. In this paper, we proposed a convolutional neural architecture by ensembling transfer learning based multi-channel attention networks. Here, four CNN branch… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 18 publications
(8 citation statements)
references
References 72 publications
0
3
0
Order By: Relevance
“…It depicts the accuracy results of different deep learningbased models with various benchmark datasets, namely HII [27], HIIv2 [6] and our H2HId dataset. Among all the models considered for evaluation, the proposed model attained the topmost accuracy of 96.38% on the HII dataset, followed by the EfficientNet [22] and EnsembleNet [33] models with 95.83% and 95.37% accuracy, respectively. Our method demonstrated high classification performance on the two benchmark datasets, namely the HII and HIIv2, yielding remarkable accuracies.…”
Section: Results Analysismentioning
confidence: 99%
“…It depicts the accuracy results of different deep learningbased models with various benchmark datasets, namely HII [27], HIIv2 [6] and our H2HId dataset. Among all the models considered for evaluation, the proposed model attained the topmost accuracy of 96.38% on the HII dataset, followed by the EfficientNet [22] and EnsembleNet [33] models with 95.83% and 95.37% accuracy, respectively. Our method demonstrated high classification performance on the two benchmark datasets, namely the HII and HIIv2, yielding remarkable accuracies.…”
Section: Results Analysismentioning
confidence: 99%
“…Lastly, the feature maps which is extracted from 4 branches concatenated, and put into fully-connected network for producing the final recognition output. Hirooka et al [12] offer a new hybrid DL network for HAR which uses multi-modal sensor data, but this presented method was ConvLSTM pipeline which completely uses the data in every layer derived from the temporal domain. At last, a fully-connected layer and a softmax function were utilized for computing the probability of every class.…”
Section: Related Workmentioning
confidence: 99%
“…To perform the activity recognition tasks from still images, some of the research focuses on using convolutional neural architectures which ensemble transfer learning-based multi-channel attention networks. The architecture has four CNN branches for feature fusion-based ensembling, with each branch having an attention module to extract contextual information from the pre-trained model's feature map [28]. In this paper, we leverage the advantages of transfer learning to propose a more accurate and less computationally complex sub-transfer learning model to boost the classification accuracies of the outlier subjects.…”
Section: Related Workmentioning
confidence: 99%