2022 IEEE International Conference on Multimedia and Expo (ICME) 2022
DOI: 10.1109/icme52920.2022.9859701
|View full text |Cite
|
Sign up to set email alerts
|

Improving Image Paragraph Captioning with Dual Relations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(1 citation statement)
references
References 20 publications
0
1
0
Order By: Relevance
“…DAM model determines the order of images based on spatial locations so that getting rid of verbose on the same object. Wang et al [116] designed Convolutional Auto-Encoding (CAE) networks to model the topics on region-level features, and further feed these topic vectors into a two-layer LSTM network. Liu et al [117] proposed DuelRel model to capture both spatial and semantic relationships, where spatial relations are acquired from a geometry pattern and semantic relations are modeled in a weakly supervised manner.…”
Section: Other Generation Related Tasksmentioning
confidence: 99%
“…DAM model determines the order of images based on spatial locations so that getting rid of verbose on the same object. Wang et al [116] designed Convolutional Auto-Encoding (CAE) networks to model the topics on region-level features, and further feed these topic vectors into a two-layer LSTM network. Liu et al [117] proposed DuelRel model to capture both spatial and semantic relationships, where spatial relations are acquired from a geometry pattern and semantic relations are modeled in a weakly supervised manner.…”
Section: Other Generation Related Tasksmentioning
confidence: 99%