2022
DOI: 10.3390/rs14061319
|View full text |Cite
|
Sign up to set email alerts
|

Dual Modality Collaborative Learning for Cross-Source Remote Sensing Retrieval

Abstract: Content-based remote sensing (RS) image retrieval (CBRSIR) is a critical way to organize high-resolution RS (HRRS) images in the current big data era. The increasing volume of HRRS images from different satellites and sensors leads to more attention to the cross-source CSRSIR (CS-CBRSIR) problem. Due to the data drift, one crucial problem in CS-CBRSIR is the modality discrepancy. Most existing methods focus on finding a common feature space for various HRRS images to address this issue. In this space, their si… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 67 publications
0
0
0
Order By: Relevance
“…Spatial attention weight coefficients are generated using the sigmoid function and multiplied with the input feature map to implement the weighted operation. The calculation process for the feature map size in the convolutional layer is summarized in ( 5) and ( 6), while that of the spatial attention submodule is summarized in (7):…”
Section: Input Features F1 Output Features F3 Channel Attention Modul...mentioning
confidence: 99%
See 1 more Smart Citation
“…Spatial attention weight coefficients are generated using the sigmoid function and multiplied with the input feature map to implement the weighted operation. The calculation process for the feature map size in the convolutional layer is summarized in ( 5) and ( 6), while that of the spatial attention submodule is summarized in (7):…”
Section: Input Features F1 Output Features F3 Channel Attention Modul...mentioning
confidence: 99%
“…Low-level features such as SIFT [4], LBP [5], and HOG [6] provide typical examples. Low-level features describe local image representation and are aggregated to form mid-level features using descriptor aggregation techniques such as BoW [7], VLAD [8], FK [9], and EMK [10]. With the development of deep learning technology and the introduction of image retrieval, convolutional neural networks (CNNs) [11] are typically used as feature extractors to obtain abstract features of remote sensing images [12], referred to as high-level features.…”
Section: Introductionmentioning
confidence: 99%