2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
DOI: 10.1109/cvpr.2017.155
|View full text |Cite
|
Sign up to set email alerts
|

CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos

Abstract: Temporal action localization is an important yet challenging problem. Given a long, untrimmed video consisting of multiple action instances and complex background contents, we need not only to recognize their action categories, but also to localize the start time and end time of each instance. Many state-of-the-art systems use segmentlevel classifiers to select and rank proposal segments of predetermined boundaries. However, a desirable model should move beyond segment-level and make dense predictions at a fin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
395
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
4
2
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 578 publications
(396 citation statements)
references
References 68 publications
1
395
0
Order By: Relevance
“…Table 2 reports the action localization results of various methods. Regarding the average mAP, P-GCN outperforms SSN [52], CDC [33], and TAL-Net [6] by 3.01%, 3.19%, and 6.77%, respectively. We observe that the method by Lin et al [27] (called LIN below) performs promisingly on this dataset.…”
Section: Comparison With State-of-the-art Resultsmentioning
confidence: 98%
See 1 more Smart Citation
“…Table 2 reports the action localization results of various methods. Regarding the average mAP, P-GCN outperforms SSN [52], CDC [33], and TAL-Net [6] by 3.01%, 3.19%, and 6.77%, respectively. We observe that the method by Lin et al [27] (called LIN below) performs promisingly on this dataset.…”
Section: Comparison With State-of-the-art Resultsmentioning
confidence: 98%
“…Temporal Action localization has attracted increasing attention in the last several years [6,18,26,33,34]. Inspired by the success of object detection, most current action detection methods resort to the two-stage pipeline: they first generate a set of 1D temporal proposals and then perform classification and temporal boundary regression on each proposal individually.…”
Section: Introductionmentioning
confidence: 99%
“…In the future, we would consider integrating the early action prediction with action localization [44], [56] together to form a more complete early action analysis system.…”
Section: Discussionmentioning
confidence: 99%
“…Once the localization of action proposals completes, the natural way for temporal action detection is to further classify the proposals into known action classes, making the process in two-stage manner [4,12,31,32,40,45]. However, the separate of proposal generation and classification may result in sub-optimal solutions.…”
Section: Related Workmentioning
confidence: 99%