2021
DOI: 10.1109/jstars.2021.3091134
|View full text |Cite
|
Sign up to set email alerts
|

Multilabel Remote Sensing Image Annotation With Multiscale Attention and Label Correlation

Abstract: Deep learning based multi-label image annotation is receiving increasing attention in the field of remote sensing due to the great success of deep networks in single-label remote sensing image classification. Compared with those low-level features, the features extracted by the convolutional neural network (CNN) are more informative and can alleviate the problem of semantic gap. However, the CNN model tends to ignore the smaller objects when objects of different sizes exist in an image. In addition, how to eff… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
13
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 23 publications
(13 citation statements)
references
References 41 publications
0
13
0
Order By: Relevance
“…The conventional feature representation methods are based on low-level visual features [41,42], which are not discriminative enough to represent complex image content. Recently, many learning-based methods [43][44][45][46] have achieved great success by representing high-level visual features of images. For instance, Xiong et al [4] utilize the attention mechanism and multi-task learning strategy to improve the discriminant capabilities of the model.…”
Section: B Feature Representation For Aerial Imagementioning
confidence: 99%
See 1 more Smart Citation
“…The conventional feature representation methods are based on low-level visual features [41,42], which are not discriminative enough to represent complex image content. Recently, many learning-based methods [43][44][45][46] have achieved great success by representing high-level visual features of images. For instance, Xiong et al [4] utilize the attention mechanism and multi-task learning strategy to improve the discriminant capabilities of the model.…”
Section: B Feature Representation For Aerial Imagementioning
confidence: 99%
“…To preserve the scene-level similarity relationships of images, attention mechanism and skip-layer connection strategy are combined to produce discriminative features for object-level aerial images annotation [45]. Huang et al [46] fuse the multiscale features from different layers and hence introduce an attention model to increase the discriminative ability of feature representation. However, it is evident that these methods fail to extract the features with different grained levels.…”
Section: B Feature Representation For Aerial Imagementioning
confidence: 99%
“…The authors trained their model in an end-to-end manner where CNN layers are formed to generate mid-level features and RNN is used for learning contextual dependencies. Huang et al [36] proposed an end-to-end deep learning model with multiscale feature fusion, channel-spatial attention, and a label correlation extraction module. Specifically, a channel-spatial attention mechanism is used to fuse and refine multi-scale features from different layers of the CNN model.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Then it creates a sense for the assorted area of the pixels, which is really what the image segment wants to get. In this way, order is paramount during clinical picture commentary [4]. Therefore, in recent years, many researchers in this field have made relentless efforts to further improve the accuracy of the sequence, although the characterization of the images concept problem is indeed critical of many problems and difficulties [5].…”
Section: Introductionmentioning
confidence: 99%