2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016
DOI: 10.1109/cvpr.2016.214
|View full text |Cite
|
Sign up to set email alerts
|

Learning Activity Progression in LSTMs for Activity Detection and Early Detection

Abstract: In this work we improve training of temporal deep models to better learn activity progression for activity detection and early detection tasks. Conventionally, when training a Recurrent Neural Network, specifically a Long Short Term Memory (LSTM) model, the training loss only considers classification error. However, we argue that the detection score of the correct activity category, or the detection score margin between the correct and incorrect categories, should be monotonically non-decreasing as the model o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
269
0
1

Year Published

2018
2018
2019
2019

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 359 publications
(272 citation statements)
references
References 23 publications
2
269
0
1
Order By: Relevance
“…This implies that our proposed soft regression framework is also beneficial for the task of action recognition and can obtain the state-of-the-art recognition result. As expected, our soft regression model outperformed the RankLSTM [36] and DeepSCN [25] approaches again, which demonstrates the efficacy of our soft label learning framework for early action prediction. We also note that the prediction results of most methods on the first 10% frames on this set is much lower than that on the ORGBD and SYSU 3D HOI sets.…”
Section: Results On the Ntu Large Scale Datasetmentioning
confidence: 50%
See 3 more Smart Citations
“…This implies that our proposed soft regression framework is also beneficial for the task of action recognition and can obtain the state-of-the-art recognition result. As expected, our soft regression model outperformed the RankLSTM [36] and DeepSCN [25] approaches again, which demonstrates the efficacy of our soft label learning framework for early action prediction. We also note that the prediction results of most methods on the first 10% frames on this set is much lower than that on the ORGBD and SYSU 3D HOI sets.…”
Section: Results On the Ntu Large Scale Datasetmentioning
confidence: 50%
“…and RankLSTM [36] with the same inputs, which demonstrates the effectiveness of our model for predicting early actions.…”
Section: Results On Online Rgb-d Action Datasetsmentioning
confidence: 72%
See 2 more Smart Citations
“…The exhaustive computation of video classifiers (sliding window) has been avoided in [11][12][13][14][15][16]. In those methods, first a number of candidate segments containing human actions, known as action proposals, are produced.…”
mentioning
confidence: 99%