Unseen Object Segmentation in Videos via Transferable Representations

Chen, Yi Wen; Tsai, Yi Hsuan; Yang, Chu Ya; Lin, Yen‐Yu; Yang, Ming–Hsuan

doi:10.1007/978-3-030-20870-7_38

Cited by 1 publication

(1 citation statement)

References 44 publications

(84 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For that reason, the well-trained models on one dataset do not perform necessarily well when applied to another. As is stated in [21,3,30] The issue of degeneration in VOS models has existed for a while, especially for off-line methods where Figure 1: Predictions between the model trained with and without our UDA method on FBMS59 [20] and Youtube-Object [24]. The optical flow (left) indicate the motion of objects.…”

Section: Introductionmentioning

confidence: 94%

DAVOS: Semi-Supervised Video Object Segmentation via Adversarial Domain Adaptation

Zhang,

Wang,

Zhang

et al. 2021

Preprint

View full text Add to dashboard Cite

Domain shift has always been one of the primary issues in video object segmentation (VOS), for which models suffer from degeneration when tested on unfamiliar datasets. Recently, many online methods have emerged to narrow the performance gap between training data (source domain) and test data (target domain) by fine-tuning on annotations of test data which are usually in shortage. In this paper, we propose a novel method to tackle domain shift by first introducing adversarial domain adaptation to the VOS task, with supervised training on the source domain and unsupervised training on the target domain. By fusing appearance and motion features with a convolution layer, and by adding supervision onto the motion branch, our model achieves stateof-the-art performance on DAVIS2016 with 82.6% mean IoU score after supervised training. Meanwhile, our adversarial domain adaptation strategy significantly raises the performance of the trained model when applied on FBMS59 and Youtube-Object, without exploiting extra annotations.

show abstract