2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00563
|View full text |Cite
|
Sign up to set email alerts
|

Temporal Recurrent Networks for Online Action Detection

Abstract: Most work on temporal action detection is formulated as an offline problem, in which the start and end times of actions are determined after the entire video is fully observed. However, important real-time applications including surveillance and driver assistance systems require identifying actions as soon as each video frame arrives, based only on current and historical observations. In this paper, we propose a novel framework, Temporal Recurrent Network (TRN), to model greater temporal context of a video fra… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
219
1

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
3

Relationship

3
5

Authors

Journals

citations
Cited by 159 publications
(220 citation statements)
references
References 55 publications
0
219
1
Order By: Relevance
“…Temporal Action Localization in long videos is widely studied in both offline and online scenarios. In the offline setting, temporal action detectors (Shou et al, 2016;Buch et al, 2017;Gao et al, 2017c;Chao et al, 2018) predict the start and end times of actions after observing the whole video, while online approaches (De Geest et al, 2016;Gao et al, 2017b;Shou et al, 2018b;Xu et al, 2018;Gao et al, 2019) label action class in a per-frame manner without accessing future information. The goal of temporal action detectors is to localize actions in pre-defined categories.…”
Section: Related Workmentioning
confidence: 99%
“…Temporal Action Localization in long videos is widely studied in both offline and online scenarios. In the offline setting, temporal action detectors (Shou et al, 2016;Buch et al, 2017;Gao et al, 2017c;Chao et al, 2018) predict the start and end times of actions after observing the whole video, while online approaches (De Geest et al, 2016;Gao et al, 2017b;Shou et al, 2018b;Xu et al, 2018;Gao et al, 2019) label action class in a per-frame manner without accessing future information. The goal of temporal action detectors is to localize actions in pre-defined categories.…”
Section: Related Workmentioning
confidence: 99%
“…Gao et al [12] propose a Reinforced Encoder-Decoder network for action anticipation and treat online action detection as a special case of their framework. Temporal Recurrent Networks [38] set a new state-of-the-art performance by conducting current and future action detection jointly. With the same goal of online per-frame labeling, these methods can serve as ClsNet in our framework.…”
Section: Related Workmentioning
confidence: 99%
“…Inspired by recent online action detection methods [9,12,38], we utilize recurrent networks, specifically, LSTM [19], to construct ClsNet. At each time t, it uses the previous hid-…”
Section: Classification Network (Clsnet)mentioning
confidence: 99%
See 1 more Smart Citation
“…Thus, our method fuses the information from both parts. Due to the good performance of recurrent networks [26], [27], [9] on online action detection tasks, our framework is based on LSTM [15].…”
Section: Model Descriptionmentioning
confidence: 99%