2020
DOI: 10.1609/aaai.v34i07.6993
|View full text |Cite
|
Sign up to set email alerts
|

Zero-Shot Sketch-Based Image Retrieval via Graph Convolution Network

Abstract: Zero-Shot Sketch-based Image Retrieval (ZS-SBIR) has been proposed recently, putting the traditional Sketch-based Image Retrieval (SBIR) under the setting of zero-shot learning. Dealing with both the challenges in SBIR and zero-shot learning makes it become a more difficult task. Previous works mainly focus on utilizing one kind of information, i.e., the visual information or the semantic information. In this paper, we propose a SketchGCN model utilizing the graph convolution network, which simultaneously cons… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
37
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 57 publications
(37 citation statements)
references
References 19 publications
0
37
0
Order By: Relevance
“…Most existing ZS-SBIR methods follow the common space paradigm and pre-define class-prototypes with language models. Graph [Zhang et al, 2020;Shen et al, 2018], cycle consistency [Dutta and Akata, 2019;Deng et al, 2020] and content-style disentanglement [Dutta and Biswas, 2019]) are employed to learn the projection to map sketches/photos close to such prototypes. [Dutta and Akata, 2019] proposes a selection layer to refine the prototypes and reduce the dimensionality of retrieval features.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Most existing ZS-SBIR methods follow the common space paradigm and pre-define class-prototypes with language models. Graph [Zhang et al, 2020;Shen et al, 2018], cycle consistency [Dutta and Akata, 2019;Deng et al, 2020] and content-style disentanglement [Dutta and Biswas, 2019]) are employed to learn the projection to map sketches/photos close to such prototypes. [Dutta and Akata, 2019] proposes a selection layer to refine the prototypes and reduce the dimensionality of retrieval features.…”
Section: Related Workmentioning
confidence: 99%
“…[Dutta and Akata, 2019] proposes a selection layer to refine the prototypes and reduce the dimensionality of retrieval features. Graph-based method [Zhang et al, 2020] adjust the language-based adjacency matrix with visual information. proposes a teacherstudent framework to preserve discriminative representations from ImageNet and coordinates the representations close to language-based prototypes.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The utility of semantic space has been validated in various previously proposed frameworks, such as [9], [10], [19], [21], to name a few. The visual space is aligned with the semantic space to aid the training process and learn from the topology of the semantic space.…”
Section: Introductionmentioning
confidence: 99%
“…However, while [9], [10], [19] used the semantic space, they made the semantic space latent and learnable causing the network to eventually loose the classwise topology information as the network is trained for more number of epochs. To preserve the original topology of the semantic space, [21], [22] proposed using a graph convolution network (GCN) [23]. While in [22] the authors use a GCN directly on the semantic graph, in [21] the authors create a fully-connected graph whose edge weights correspond to the semantic distances and the node features comprise of classwise visual features.…”
Section: Introductionmentioning
confidence: 99%