2018
DOI: 10.1007/978-3-030-01228-1_36
|View full text |Cite
|
Sign up to set email alerts
|

YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

Abstract: Learning long-term spatial-temporal features are critical for many video analysis tasks. However, existing video segmentation methods predominantly rely on static image segmentation techniques, and methods capturing temporal dependency for segmentation have to depend on pretrained optical flow models, leading to suboptimal solutions for the problem. End-to-end sequential learning to explore spatialtemporal features for video segmentation is largely limited by the scale of available video segmentation datasets,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
532
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 512 publications
(533 citation statements)
references
References 43 publications
1
532
0
Order By: Relevance
“…Datasets We evaluate our method on two video object segmentation datasets: Youtube-VOS [37] and DAVIS-2017 [25]. Training The network is trained using the objective function described in 3.4.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…Datasets We evaluate our method on two video object segmentation datasets: Youtube-VOS [37] and DAVIS-2017 [25]. Training The network is trained using the objective function described in 3.4.…”
Section: Methodsmentioning
confidence: 99%
“…Semi-supervised video object segmentation: Earlier works in video object segmentation used hand-crafted features based on appearance, boundary and optical flow [1,9,15,27,23]. The availability of large-scale video object segmentation datasets [25,37] enabled us to explore deep learning methods for this problem. Most of the early works are mainly motivated by the image segmentation methods [3,35,20].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Video object and instance segmentation problems have received significant attention [1,35,25] attributed to the recent availability of high-quality datasets, e.g., YouTube-VOS [45], DAVIS [29,30]. Given an input video, the aim is to separate the objects or instances from the background at the pixel-level.…”
Section: Introductionmentioning
confidence: 99%