2018
DOI: 10.48550/arxiv.1809.00461
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 25 publications
0
7
0
Order By: Relevance
“…We don't run the original PReMVOS method but the PReMVOS Fast-finetuned version (see Section 4.3), which can be evaluated on this larger dataset in a more reasonable amount of time. Our results are much better than [33], the only other method that has published results on this dataset. It is also better than all other methods that submitted results to the 2017 1st Large-scale Video Object Segmentation Challenge.…”
Section: Further Large-scale Evaluationmentioning
confidence: 60%
See 1 more Smart Citation
“…We don't run the original PReMVOS method but the PReMVOS Fast-finetuned version (see Section 4.3), which can be evaluated on this larger dataset in a more reasonable amount of time. Our results are much better than [33], the only other method that has published results on this dataset. It is also better than all other methods that submitted results to the 2017 1st Large-scale Video Object Segmentation Challenge.…”
Section: Further Large-scale Evaluationmentioning
confidence: 60%
“…In Table 6 we present the results of PReMVOS on the new YouTube-VOS dataset [33] our results on the test set obtained 1st place in the 1st Large-scale Video Object Segmentation Challenge. We don't run the original PReMVOS method but the PReMVOS Fast-finetuned version (see Section 4.3), which can be evaluated on this larger dataset in a more reasonable amount of time.…”
Section: Further Large-scale Evaluationmentioning
confidence: 99%
“…( 7)) for temporally coherent predictions. We use videos in the Youtube-VOS dataset [29] as ground-truth for the training. It is a large-scale dataset for video object segmentation containing 4000+ YouTube videos with 70+ common objects.…”
Section: Two-stage Trainingmentioning
confidence: 99%
“…In the context of the video object removal task, masks with the most realistic appearance and motion can be obtained from video object segmentation datasets. We use the foreground segmentation masks of the YouTube-VOS dataset [29].…”
Section: Video Object Maskmentioning
confidence: 99%
“…The second set of approaches formulates segmentation as a foreground-background classification task, detecting regions that correspond to foreground objects and matching the resulting appearance models with other information such as salience maps, shape estimates, and pairwise constraints [32,22,46]. The third set of approaches incorporate the classification approach with a memory module for propagating region estimates in time [44,48]. The latter set of approaches begin to incorporate temporal dynamics information into the learning problem, and due to their success, motivate the method, observation, and dataset proposed in our work.…”
Section: Semantic Segmentation In Videomentioning
confidence: 99%