2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00170
|View full text |Cite
|
Sign up to set email alerts
|

Multi-label Zero-Shot Learning with Structured Knowledge Graphs

Abstract: In this paper, we propose a novel deep learning architecture for multi-label zero-shot learning (ML-ZSL), which is able to predict multiple unseen class labels for each input instance. Inspired by the way humans utilize semantic knowledge between objects of interests, we propose a framework that incorporates knowledge graphs for describing the relationships between multiple labels. Our model learns an information propagation mechanism from the semantic label space, which can be applied to model the interdepend… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
173
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 277 publications
(173 citation statements)
references
References 39 publications
0
173
0
Order By: Relevance
“…Quantitative results are reported in Table 1. We compare with state-of-the-art methods, including CNN-RNN [28], RNN-Attention [29], Order-Free RNN [1], ML-ZSL [15], SRN [36], Multi-Evidence [6], etc. For the proposed ML-GCN, we report the results based on the binary correlation matrix ("ML-GCN (Binary)") and the re-weighted correlation matrix ("ML-GCN (Re-weighted)"), respectively.…”
Section: Comparisons With State-of-the-artsmentioning
confidence: 99%
“…Quantitative results are reported in Table 1. We compare with state-of-the-art methods, including CNN-RNN [28], RNN-Attention [29], Order-Free RNN [1], ML-ZSL [15], SRN [36], Multi-Evidence [6], etc. For the proposed ML-GCN, we report the results based on the binary correlation matrix ("ML-GCN (Binary)") and the re-weighted correlation matrix ("ML-GCN (Re-weighted)"), respectively.…”
Section: Comparisons With State-of-the-artsmentioning
confidence: 99%
“…It has been extensively studied to incorporate prior knowledge to aid numerous vision tasks [20,8,15,7,2,18]. For example, Marino et al [20] constructed a knowledge graph based on the WordNet [22] and the Visual Genome dataset [14], and learned the representation of this graph to enhance image feature representation to promote multilabel recognition.…”
Section: Knowledge Representationmentioning
confidence: 99%
“…Taking advantage of geometric deep learning [19,20], our Graphonomy simply integrates two cooperative modules for graph transfer learning. First, we introduce Intra-Graph Reasoning to progressively refine graph representations within the same graph structure, in which each graph node is responsible for segmenting out regions of one semantic part in a dataset.…”
Section: Inter-graph Connectionmentioning
confidence: 99%
“…Knowledge-guided Graph Reasoning. Many research efforts recently model domain knowledge as a graph for mining correlations among labels or objects in images, which has been proved effective in many tasks [5,19,20,29,35]. For example, Chen et al [5] leveraged local regionbased reasoning and global reasoning to facilitate object detection.…”
Section: Related Workmentioning
confidence: 99%