2024
DOI: 10.3390/rs16010196
|View full text |Cite
|
Sign up to set email alerts
|

Cross-Modal Retrieval and Semantic Refinement for Remote Sensing Image Captioning

Zhengxin Li,
Wenzhe Zhao,
Xuanyi Du
et al.

Abstract: Two-stage remote sensing image captioning (RSIC) methods have achieved promising results by incorporating additional pre-trained remote sensing tasks to extract supplementary information and improve caption quality. However, these methods face limitations in semantic comprehension, as pre-trained detectors/classifiers are constrained by predefined labels, leading to an oversight of the intricate and diverse details present in remote sensing images (RSIs). Additionally, the handling of auxiliary remote sensing … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
references
References 43 publications
(84 reference statements)
0
0
0
Order By: Relevance