2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.01015
|View full text |Cite
|
Sign up to set email alerts
|

Action Recognition From Single Timestamp Supervision in Untrimmed Videos

Abstract: Recognising actions in videos relies on labelled supervision during training, typically the start and end times of each action instance. This supervision is not only subjective, but also expensive to acquire. Weak video-level supervision has been successfully exploited for recognition in untrimmed videos, however it is challenged when the number of different actions in training videos increases. We propose a method that is supervised by single timestamps located around each action instance, in untrimmed videos… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
49
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 59 publications
(49 citation statements)
references
References 31 publications
(58 reference statements)
0
49
0
Order By: Relevance
“…from [31]. This plateau function has three parameters and it is differentiable with respect to them: λ c controls the center of the plateau, λ w is the width and λ s is the sharpness of the plateau function.…”
Section: Approximate Differentiable Energy E *mentioning
confidence: 99%
“…from [31]. This plateau function has three parameters and it is differentiable with respect to them: λ c controls the center of the plateau, λ w is the width and λ s is the sharpness of the plateau function.…”
Section: Approximate Differentiable Energy E *mentioning
confidence: 99%
“…At present, a large number of researchers have been working in the field of video-based human action recognition, and have achieved many surprising results, 16 making the current human action recognition with very high recognition accuracy. Video-based human action recognition is mainly developed on the understanding of color video.…”
Section: Related Workmentioning
confidence: 99%
“…Video-based human action recognition is a very popular direction in the field of computer vision, 16 because it has been widely used in people's lives, such as video surveillance, human-computer interaction, video understand -ing, smart medicine and so on.In particular, the launch of the Kinect sensor camera by Microsoft has triggered a surge in the use of human skeleton information to analyze human action. Because of its powerful skeleton tracking function, Kinect enables researchers to analyze human action using human skeleton motion trajectories and has been very successful in many fields.…”
Section: Introductionmentioning
confidence: 99%
“…While transcript-level and set-level supervision significantly reduce the annotation effort, the performance is not satisfying and there is still a gap compared to fully supervised approaches. In this paper, inspired by the recently introduced timestamp supervision for action recognition [31], we propose to use timestamp supervision for the action segmentation task to address the limitations of the current weakly supervised approaches. For timestamp supervision, only one frame is annotated from each action segment as illustrated in Fig.…”
Section: Introductionmentioning
confidence: 99%