2018
DOI: 10.1109/tgrs.2017.2776321
|View full text |Cite
|
Sign up to set email alerts
|

Exploring Models and Data for Remote Sensing Image Caption Generation

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
244
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 380 publications
(283 citation statements)
references
References 46 publications
1
244
0
1
Order By: Relevance
“…The compression methods for multisource image/video data are designed from the perspective of image features, which usually mine similarities between image blocks by matching feature points. Moreover, multiscale features for image representation are proposed to extend representation from single payload to multiple payloads, as being proposed in References [35][36][37][38], which is also a way to build relations between multiple data sources. However, computational complexity is high, and the actual correspondence between the selected image block and the coding object is often lacking, which is not conducive to large-area matching.…”
Section: Video Compression Of Multisource Image/video Datamentioning
confidence: 99%
“…The compression methods for multisource image/video data are designed from the perspective of image features, which usually mine similarities between image blocks by matching feature points. Moreover, multiscale features for image representation are proposed to extend representation from single payload to multiple payloads, as being proposed in References [35][36][37][38], which is also a way to build relations between multiple data sources. However, computational complexity is high, and the actual correspondence between the selected image block and the coding object is often lacking, which is not conducive to large-area matching.…”
Section: Video Compression Of Multisource Image/video Datamentioning
confidence: 99%
“…To follow the direction of scene caption, a well-annotated scene caption dataset is also necessary. Researchers have presented a few exemplary works on remote sensing image caption [23,24], and have constructed a large-scale dataset under specific annotated instructions in consideration of characteristics of remote sensing images, e.g., not using words that represent the concept of "direction" and "vague". We believe that the scene caption will be a new chance to generate better description of scenes in remote sensing images and will receive more concerns from remote sensing community.…”
Section: Better Describing the Content Of Scenesmentioning
confidence: 99%
“…With the development of deep learning on computer vision, scene understanding [9,10,11,12,13,14] achieves a remarkable progress. At present, CNN-based methods [15,16] attain the significant performance for crowd counting.…”
Section: Introductionmentioning
confidence: 99%