2018
DOI: 10.48550/arxiv.1806.02964
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

BSN: Boundary Sensitive Network for Temporal Action Proposal Generation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
44
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
2

Relationship

3
5

Authors

Journals

citations
Cited by 26 publications
(44 citation statements)
references
References 26 publications
0
44
0
Order By: Relevance
“…However, the performance of the employed methods was limited by the hand-crafted feature representations. Recently, deep networks were applied to action localization to achieve improved performance [28], [29], [30], [31], [13], [32]. In S-CNN [28], multi-stage CNN that utilizes 3D-ConvNet (C3D) [9] is used to simultaneously capture the spatial and temporal features.…”
Section: B Temporal Action Localizationmentioning
confidence: 99%
“…However, the performance of the employed methods was limited by the hand-crafted feature representations. Recently, deep networks were applied to action localization to achieve improved performance [28], [29], [30], [31], [13], [32]. In S-CNN [28], multi-stage CNN that utilizes 3D-ConvNet (C3D) [9] is used to simultaneously capture the spatial and temporal features.…”
Section: B Temporal Action Localizationmentioning
confidence: 99%
“…Improved-BSN. BSN [7] also generates the proposals in a bottom-up fashion which first evaluates the probabilities of each temporal location being in the starting, ending and middling regions. Then through combining the high probability boundary locations, it can generate abundant proposals with flexible durations and confidence scores.…”
Section: Multi-granularity Fusion Networkmentioning
confidence: 99%
“…However, the pro-posals generated in a top-down fashion are doomed to have imprecise boundaries though with regression. Under this circumstance, the other type of methods [7,11] have drawn much attention in the community recently which tackle this problem in a bottom-up fashion, where the input video is evaluated in a finer-level. [7] is a typical method in this type which proposes the Boundary Sensitive Network (BSN) to generate proposals with flexible durations and reliable confidence scores.…”
Section: Task Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…It leads to an important yet challenging task for video analysis: Temporal Action Localization (TAL), which requires to not only classify the untrimmed videos into specific categories accurately, but also locate the temporal boundaries of action instances precisely. Although substantial progress has been achieved on this task [41], [26], [39], [16], [6], [18], [10], [9], it is still limited for industrial applications due to the huge amount of temporal annotations used for training such a deep learning based model in a fully-supervised manner, which are laborintensive to annotate especially for a large-scale dataset. On the contrary, weak labels such as video-level labels are much easier to obtain, hence many current works try to handle this problem under weak supervision.…”
Section: Introductionmentioning
confidence: 99%