2020
DOI: 10.48550/arxiv.2007.11731
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Comprehensive Image Captioning via Scene Graph Decomposition

Abstract: We address the challenging problem of image captioning by revisiting the representation of image scene graph. At the core of our method lies the decomposition of a scene graph into a set of subgraphs, with each sub-graph capturing a semantic component of the input image. We design a deep model to select important sub-graphs, and to decode each selected sub-graph into a single target sentence. By using sub-graphs, our model is able to attend to different components of the image. Our method thus accounts for acc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 65 publications
0
1
0
Order By: Relevance
“…‡ Yong Zhang, Baoyuan Wu and Yujiu Yang are the corresponding authors. as image captioning [33,42] and visual question answering [36]. Intuitively, the latter would get greater benefit from more human-like scene graphs.…”
Section: Introductionmentioning
confidence: 99%
“…‡ Yong Zhang, Baoyuan Wu and Yujiu Yang are the corresponding authors. as image captioning [33,42] and visual question answering [36]. Intuitively, the latter would get greater benefit from more human-like scene graphs.…”
Section: Introductionmentioning
confidence: 99%