2021
DOI: 10.21203/rs.3.rs-1179245/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Aircraft Detection in Remote Sensing Image based on Multi-scale Convolution Neural Network with Attention Mechanism

Abstract: Detecting aircraft from remote sensing image (RSI) is an important but challenging task due to the variations of aircraft type, size, pose, angle, complex background and small size of aircraft in RSIs. An aircraft detection method is proposed based on multi-scale convolution neural network with attention (MSCNNA), consisting of encoder, decoder, attention and classification. In MSCNNA, the multiple convolutional and pooling kernels with different sizes are utilized to learn the multi-scale discriminant feature… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 28 publications
0
4
0
Order By: Relevance
“…To preserve the original rating information and improve the structure of the model stability and training efficiency, we normalized the sum of Self-attention and Rating. The residual network can maintain the performance of the network model with an increase in the number of network layers and then slow down the degradation of the model performance [48].…”
Section: Self-attention Modulementioning
confidence: 99%
“…To preserve the original rating information and improve the structure of the model stability and training efficiency, we normalized the sum of Self-attention and Rating. The residual network can maintain the performance of the network model with an increase in the number of network layers and then slow down the degradation of the model performance [48].…”
Section: Self-attention Modulementioning
confidence: 99%
“…In the decoding phase, the decoder receives the feature map from the encoder. Through the Transformer module and the upsampling stage, the decoder conducts decoding operations while also performing skip connections [18] . Ultimately, in the final layer of the network structure, the decoder outputs a denoised image of size 512×512.…”
Section: Denoising Network Structurementioning
confidence: 99%
“…With the increase in GPU computing power and the research of neural networks in recent years, object detection has become a hot spot in global artificial intelligence research. Most of the current mainstream object detection methods are based on convolutional neural networks, and in recent years, two main categories have been formed: candidate region-based and regression-based [ 18 ]. Candidate region-based object detection methods, also known as two-stage methods, divide the object detection problem into two stages; one is to generate candidate regions, and the other is to put the candidate regions into the classifier to classify and correct the position.…”
Section: Related Workmentioning
confidence: 99%