2016
DOI: 10.1016/j.neucom.2016.08.032
|View full text |Cite
|
Sign up to set email alerts
|

Weakly supervised activity analysis with spatio-temporal localisation

Abstract: In computer vision, an increasing number of weakly annotated videos have become available, due to the fact it is often difficult and time consuming to annotate all the details in the videos collected. Learning methods that analyse human activities in weakly annotated video data have gained great interest in recent years. They are categorised as "weakly supervised learning", and usually form a multi-instance multi-label (MIML) learning problem. In addition to the commonly known difficulties of MIML learning, i.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 41 publications
0
2
0
Order By: Relevance
“…Moreover, some recent works tend to explore object-based representations without using any explicit semantic object segmentation techniques, which seeks a synergy between the MIML and object-level representations. [11] address this weakly supervised issue in multi-label human action detection with a two-stage solution. First, a set of potential objects or spatial-temporal volumes are generated and selected from a video instance with a set of handcrafted rules.…”
Section: Multi-label Learningmentioning
confidence: 99%
See 1 more Smart Citation
“…Moreover, some recent works tend to explore object-based representations without using any explicit semantic object segmentation techniques, which seeks a synergy between the MIML and object-level representations. [11] address this weakly supervised issue in multi-label human action detection with a two-stage solution. First, a set of potential objects or spatial-temporal volumes are generated and selected from a video instance with a set of handcrafted rules.…”
Section: Multi-label Learningmentioning
confidence: 99%
“…( 8) and ( 9) or the regularized hinge rank loss functions in Eqs. (10) and (11) to train visual and semantic embedding models in our framework.…”
Section: Rank Loss Functionmentioning
confidence: 99%