2019
DOI: 10.3390/rs11070755
|View full text |Cite
|
Sign up to set email alerts
|

Geospatial Object Detection on High Resolution Remote Sensing Imagery Based on Double Multi-Scale Feature Pyramid Network

Abstract: Object detection on very-high-resolution (VHR) remote sensing imagery has attracted a lot of attention in the field of image automatic interpretation. Region-based convolutional neural networks (CNNs) have been vastly promoted in this domain, which first generate candidate regions and then accurately classify and locate the objects existing in these regions. However, the overlarge images, the complex image backgrounds and the uneven size and quantity distribution of training samples make the detection tasks mo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
48
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 74 publications
(48 citation statements)
references
References 54 publications
0
48
0
Order By: Relevance
“…To deal with the problem of multi-scale detection with the large ratio of remote sensing objects, Guo et al [26] proposed a unified multi-scale framework, which is composed of multi-scale object proposal network and a multi-scale detection network. To achieve further accuracy of the localization in aerial images, Zhang et al [27] proposed a Double Multi-scale Feature Pyramid Network (DM-FPN), which makes the most of semantic and resolution features simultaneously and bring up some multi-scale training, inference and adaptive categorical non-maximum suppression (ACNMS) strategies. In addition, object detection based on weakly supervised deep learning method arouses more and more attentions of researchers in recent years.…”
Section: Introductionmentioning
confidence: 99%
“…To deal with the problem of multi-scale detection with the large ratio of remote sensing objects, Guo et al [26] proposed a unified multi-scale framework, which is composed of multi-scale object proposal network and a multi-scale detection network. To achieve further accuracy of the localization in aerial images, Zhang et al [27] proposed a Double Multi-scale Feature Pyramid Network (DM-FPN), which makes the most of semantic and resolution features simultaneously and bring up some multi-scale training, inference and adaptive categorical non-maximum suppression (ACNMS) strategies. In addition, object detection based on weakly supervised deep learning method arouses more and more attentions of researchers in recent years.…”
Section: Introductionmentioning
confidence: 99%
“…[64] Used a classical Fuzzy C-means (FCM) method was for the coastline detection, but had been improved by combining the Wavelet decomposition algorithm to better suppress the inherent speckle noises of SAR image. In [65], [66] the authors proposed an end-to-end framework called multiple feature pyramid network (MFPN). In MFPN, an effective feature pyramid and a tailored pyramid pooling module are implemented that takes advantage of multilevel semantic features of high resolution remote sensing images.…”
Section: Introductionmentioning
confidence: 99%
“…On this basis, object detection in remote sensing imagery has been widely studied in recent years [27][28][29][30][31][32]. In the field of remote sensing, many researchers have made great efforts to object detection methods based on CNN [33][34][35][36][37][38][39].…”
Section: Introductionmentioning
confidence: 99%
“…Note that detectors in this model are used in conjunction with multi-scale feature fusion modules. Based on the SSD [25], a novel single-shot detector named the Recurrent Detection with Activated Semantics (RDAS) structure is presented for addressing the small-scaled object fast detection problem in VHR remote sensing in [35]. Besides, the shared multi-scale base network and the multi-scale object proposal network were employed in [38], which enables the production of feature maps with high semantic information at different layers and generation of anchor boxes that cover most of the objects with a small number of negative samples.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation