2022
DOI: 10.1109/jstars.2022.3158903
|View full text |Cite
|
Sign up to set email alerts
|

Learning Calibrated-Guidance for Object Detection in Aerial Images

Abstract: Object detection is one of the most fundamental yet challenging research topics in the domain of computer vision. Recently, the study on this topic in aerial images has made tremendous progress. However, complex background and worse imaging quality are obvious problems in aerial object detection. Most state-of-the-art approaches tend to develop elaborate attention mechanisms for the space-time feature calibrations with arduous computational complexity, while surprisingly ignoring the importance of feature cali… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
9
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
10

Relationship

0
10

Authors

Journals

citations
Cited by 31 publications
(9 citation statements)
references
References 59 publications
(130 reference statements)
0
9
0
Order By: Relevance
“…Another line of research focuses on feature alignment in the semantic dimension. CGNet [ 39 ] uses self-attention to enhance communication between pyramid levels. AFF-Det [ 14 , 18 ] maps ROIs to all levels and applies a unified supervisory signal to alleviate the semantic gap.…”
Section: Methodsmentioning
confidence: 99%
“…Another line of research focuses on feature alignment in the semantic dimension. CGNet [ 39 ] uses self-attention to enhance communication between pyramid levels. AFF-Det [ 14 , 18 ] maps ROIs to all levels and applies a unified supervisory signal to alleviate the semantic gap.…”
Section: Methodsmentioning
confidence: 99%
“…As a result of the tiny scale of small targets, the prediction of centroids is useful for localizing small targets and is a viable approach for small target identification. Transformer was used by WEI et al [3] to detect small targets, and they also put forward the CG-Net (Calibrated-Guidance) idea, which improves the connectivity between channels using the Transformer function. Extracting rich contexts from the many windows that were generated would help train improved target representation.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, tiny object detection has become one of the most challenging tasks in computer vision [32]. In allusion to the smaller size and higher density of the objects in the aerial images, Wei et al proposed an efficacious calibrated-guidance (CG) scheme to intensify the channel communication through the feature transformer fashion, which could adaptively determine the calibration weights for each channel based on the global feature affinity correlations [33]. The concept of fusion factor was proposed by Gong et al to control the information that delivered from deep layers to the shallow ones, which adapted the feather pyramid network (FPN) to tiny object detection, and its effective value was estimated based on a statistical method [34].…”
Section: Introductionmentioning
confidence: 99%