2019 IEEE Winter Conference on Applications of Computer Vision (WACV) 2019
DOI: 10.1109/wacv.2019.00019
|View full text |Cite
|
Sign up to set email alerts
|

Still Image Action Recognition by Predicting Spatial-Temporal Pixel Evolution

Abstract: We propose a novel approach based on deep Convolutional Neural Networks (CNN) to recognize human actions in still images by predicting the future motion, and detecting the shape and location of the salient parts of the image. We make the following major contributions to this important area of research: (i) We use the predicted future motion in the static image (Walker et al., 2015) as a means of compensating for the missing temporal information, while using the saliency map to represent the the spatial informa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
3

Relationship

1
9

Authors

Journals

citations
Cited by 19 publications
(9 citation statements)
references
References 103 publications
0
9
0
Order By: Relevance
“…Action Recognition from a Single Image. One direction for action recognition is purely based on a single image [10,56,40,5]. In [10], multiple small objects are first identified in a still image and then the target action is inferred from the relationship among the objects.…”
Section: Related Workmentioning
confidence: 99%
“…Action Recognition from a Single Image. One direction for action recognition is purely based on a single image [10,56,40,5]. In [10], multiple small objects are first identified in a still image and then the target action is inferred from the relationship among the objects.…”
Section: Related Workmentioning
confidence: 99%
“…Khan et al [26] proposed an action-specific person detection based on transfer learning to improve the quality of bounding boxes. Recently, deep learning has produced successful results in single-image-based action recognition [26][27][28][29][30]. Zhang et al [28] proposed action recognition with only action labels, which learns the action masks and extracts the features from the objects in an action.…”
Section: Single-image-based Action Recognitionmentioning
confidence: 99%
“…Still image action recognition has recently benefited from the outstanding performance of CNN models (Gao, Xiong, and Grauman 2018;Gkioxari, Girshick, and Malik 2015;Safaei and Foroosh 2019;Hoai 2014;Oquab et al 2014;Rahman and Wang 2016). The tradeoff is the need for millions of parameters and dependency on huge training sets.…”
Section: Related Workmentioning
confidence: 99%