2020
DOI: 10.3390/app10041531
|View full text |Cite
|
Sign up to set email alerts
|

Body-Part-Aware and Multitask-Aware Single-Image-Based Action Recognition

Abstract: Action recognition is an application that, ideally, requires real-time results. We focus on single-image-based action recognition instead of video-based because of improved speed and lower cost of computation. However, a single image contains limited information, which makes single-image-based action recognition a difficult problem. To get an accurate representation of action classes, we propose three feature-stream-based shallow sub-networks (image-based, attention-image-based, and part-image-based feature ne… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 35 publications
0
6
0
Order By: Relevance
“…[20] implemented simple image-based action recognition based on an HRNet [21] human pose estimation network. HRNet [21] represents multitasking features with a set of feature maps extracted from an image in the decreasing order of resolution and increasing order of channels [20].…”
Section: Single Image-based Action Recognition Bhandari Et Almentioning
confidence: 99%
See 2 more Smart Citations
“…[20] implemented simple image-based action recognition based on an HRNet [21] human pose estimation network. HRNet [21] represents multitasking features with a set of feature maps extracted from an image in the decreasing order of resolution and increasing order of channels [20].…”
Section: Single Image-based Action Recognition Bhandari Et Almentioning
confidence: 99%
“…Attention Image-Based Feature Stream. Bhandari et al [20] and Fukui et al [22] used attention mechanisms such as attention-based image feature extraction streams foreground analysis. is method uses an attention image-based feature stream [23], which is built on top of ResNet18 [24].…”
Section: Single Image-based Action Recognition Bhandari Et Almentioning
confidence: 99%
See 1 more Smart Citation
“…As reported in Table 6, SCLAR yields the best performance in terms of accuracy and efficiency in comparison with state-of-the-art works. More specifically, as a runner-up, SCLAR yields gains of 0.41% and 0.66% on mAP compared to the second [19] and third [54] best methods. While the obtained gains in respect to the second-best method [19] is not significantly pronounced, the efficiency gain is dramatically considerable.…”
Section: Stanford40mentioning
confidence: 99%
“…Yet, [19] and [54] require additional supervision, such as human bounding boxes or predefined body parts, which confine their practical applications to specific domains with available bounding boxes. Besides, it is proven that SCLAR outperforms other studies [12,[20][21][22] to a great degree.…”
Section: Stanford40mentioning
confidence: 99%