2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00238
|View full text |Cite
|
Sign up to set email alerts
|

SFNet: Learning Object-Aware Semantic Correspondence

Abstract: We address the problem of semantic correspondence, that is, establishing a dense flow field between images depicting different instances of the same object or scene category. We propose to use images annotated with binary foreground masks and subjected to synthetic geometric deformations to train a convolutional neural network (CNN) for this task. Using these masks as part of the supervisory signal offers a good compromise between semantic flow methods, where the amount of training data is limited by the cost … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
191
0
2

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 149 publications
(193 citation statements)
references
References 44 publications
0
191
0
2
Order By: Relevance
“…Argmax . The argmax function [ 38 ] of each pixel-bit depth vector is used to decompose the predicted values into segmentation masks and to get the enhancement result. The enhancement method based on WCF is then employed to improve the segmentation result and better capture the object boundaries.…”
Section: Methodsmentioning
confidence: 99%
“…Argmax . The argmax function [ 38 ] of each pixel-bit depth vector is used to decompose the predicted values into segmentation masks and to get the enhancement result. The enhancement method based on WCF is then employed to improve the segmentation result and better capture the object boundaries.…”
Section: Methodsmentioning
confidence: 99%
“…In other words, considering context helps to improve the performance of semantic matching. To this end, a direct way is to explicitly add neighborhood continuity constraints to loss, such as smoothness and geometric consistency constraints [34,35]. Another strategy is to consider the spatial context when extracting semantic features so that the features can perceive local information.…”
Section: Spatial Context For Semantic Matchingmentioning
confidence: 99%
“…Instead of using the foreground-mask correspondences as supervision signals [34], our method uses keypoint labels since they have pixel-level ground-truth matches. This stronger supervision signal can guide the network to estimate the matching field between images.…”
Section: Objective Functionmentioning
confidence: 99%
“…Wang et al [20] used an unsupervised method to learn feature representations for identifying correspondences across frames. Lee et al [21] attempted to derive semantic correspondences by objectaware losses. Compared with pixel-level sharing, featurelevel sharing is learned by an end-to-end process, and it is thus difficult to directly evaluate its performance.…”
Section: B Semantics Sharingmentioning
confidence: 99%