2015 International Conference on Image Processing Theory, Tools and Applications (IPTA) 2015
DOI: 10.1109/ipta.2015.7367098
|View full text |Cite
|
Sign up to set email alerts
|

Deep learning based super-resolution for improved action recognition

Abstract: Action recognition systems mostly work with videos of proper quality and resolution. Even most challenging benchmark databases for action recognition, hardly include videos of low-resolution from, e.g., surveillance cameras. In videos recorded by such cameras, due to the distance between people and cameras, people are pictured very small and hence challenge action recognition algorithms. Simple upsampling methods, like bicubic interpolation, cannot retrieve all the detailed information that can help the recogn… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(11 citation statements)
references
References 26 publications
0
11
0
Order By: Relevance
“…Immediate effort is also expected in action/gesture localization in long, untrimmed, and realistic videos [128,34,95]. As such, we envision newer problems like early recognition [28], multi-task learning [127], captioning, recognition from low resolution sequences [66] and lifelog devices [87] will receive attention in the next years.…”
Section: Discussionmentioning
confidence: 99%
“…Immediate effort is also expected in action/gesture localization in long, untrimmed, and realistic videos [128,34,95]. As such, we envision newer problems like early recognition [28], multi-task learning [127], captioning, recognition from low resolution sequences [66] and lifelog devices [87] will receive attention in the next years.…”
Section: Discussionmentioning
confidence: 99%
“…Additional effort is expected to advance in the research of methods able to simultaneously perform both detection and recognition tasks in long, realistic videos (Gkioxari and Malik, 2015;Shou et al, 2016b). As such, we envision other related problems like early recognition Escalante et al (2016a), multi task learning , captioning, recognition from low resolution sequences Nasrollahi et al (2015) and from lifelog devices Rhinehart and Kitani (2016) will receive special attention within the next few years.…”
Section: Future Workmentioning
confidence: 99%
“…Skeleton tracking has become much easier with the appearance of motion capture systems, which automatically generate the human skeleton represented by 3-dimensional (3D) coordinates. Additionally, it brought up an increase of research on body movement, such as unusual event detection and crime prevention [ 16 , 17 , 18 , 19 , 20 ].…”
Section: Introductionmentioning
confidence: 99%