2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00889
|View full text |Cite
|
Sign up to set email alerts
|

Two-Stream Action Recognition-Oriented Video Super-Resolution

Abstract: We study the video super-resolution (SR) problem for facilitating video analytics tasks, e.g. action recognition, instead of for visual quality. The popular action recognition methods based on convolutional networks, exemplified by two-stream networks, are not directly applicable on video of low spatial resolution. This can be remedied by performing video SR prior to recognition, which motivates us to improve the SR procedure for recognition accuracy. Tailored for two-stream action recognition networks, we pro… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
2
1

Relationship

1
9

Authors

Journals

citations
Cited by 51 publications
(21 citation statements)
references
References 45 publications
0
21
0
Order By: Relevance
“…Considering that this is not flexible for HAR, Wang [109] introduced an end-to-end framework that is capable of taking videos of arbitrary sizes and lengths as inputs. To address the problem of HAR using videos at low spatial resolutions, Zhang et al [118] studied the super-resolution problem of video for HAR.…”
Section: Deep Learning Methodsmentioning
confidence: 99%
“…Considering that this is not flexible for HAR, Wang [109] introduced an end-to-end framework that is capable of taking videos of arbitrary sizes and lengths as inputs. To address the problem of HAR using videos at low spatial resolutions, Zhang et al [118] studied the super-resolution problem of video for HAR.…”
Section: Deep Learning Methodsmentioning
confidence: 99%
“…In video deblurring, [26] estimated the segmentation probability map first and finished deblurring via the segmentation prior. To improve classification, [28] enhanced the target image and [37] increased the resolution of the input image, and [32] restored the degraded images to clean images in the feature domain of a classifier. These works above focused on the performance of only one task.…”
Section: Synergy Between High-level and Low-level Vision Tasksmentioning
confidence: 99%
“…Generally, most of two-stream type works ( [36], [37], [38]) learn to fuse two outputs of this network. Indeed, we can directly add the connection between outputs of IENet and RENet.…”
Section: B Context-sensitive Decomposition Connectionmentioning
confidence: 99%