2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
DOI: 10.1109/cvpr.2017.224
|View full text |Cite
|
Sign up to set email alerts
|

Budget-Aware Deep Semantic Video Segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
36
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 50 publications
(36 citation statements)
references
References 27 publications
0
36
0
Order By: Relevance
“…Besides, temporal dynamics are also learnt using a sequential model like LSTM [52]. Moreover, LSTM is also used to select keyframes for video scene parsing [32]. Bowen Wang et al [48], proposed Noisy-LSTM which uses convLSTM for video semantic segmentation.…”
Section: Related Workmentioning
confidence: 99%
“…Besides, temporal dynamics are also learnt using a sequential model like LSTM [52]. Moreover, LSTM is also used to select keyframes for video scene parsing [32]. Bowen Wang et al [48], proposed Noisy-LSTM which uses convLSTM for video semantic segmentation.…”
Section: Related Workmentioning
confidence: 99%
“…The Flow-Track framework [45] combines historical feature flows, warping, consine similarity and temporal attention for efficient feature aggregation achieving an increased performance. Some techniques, primarily meant for semantic object segmentation [46]- [48], also utilized warped feature flow maps to aid the final stages of the segmentation pipeline.…”
Section: B Feature Aggregation Over Timementioning
confidence: 99%
“…Other techniques take advantage of the reinforcement learning framework for adaptive keyframe selection [48], [50], [61]. A policy-gradient reinforcement-learning approach [48] makes budget-aware processing by approximating the gradient of a non-decomposable and nondifferentiable objective. In [68], a learned policy is able to select when to update or to initiate a tracker.…”
Section: Adaptive Keyframe Selectionmentioning
confidence: 99%
See 1 more Smart Citation
“…Moreover, as a sequential learning network, recurrent neural network (RNN) [23] or long short‐term memory (LSTM) [24] is also utilised to adapt CNN‐based image segmentation methods to video semantic segmentation [8]. In [25], they propose a budget‐aware framework that learns to optimally select a small subset of frames for pixel‐wise labelling by a CNN, and then interpolates the obtained segmentations to unprocessed frames. However, the method first needs to use an image‐based method to segment some video frames.…”
Section: Related Workmentioning
confidence: 99%