2019
DOI: 10.3390/rs11242930
|View full text |Cite
|
Sign up to set email alerts
|

Mask OBB: A Semantic Attention-Based Mask Oriented Bounding Box Representation for Multi-Category Object Detection in Aerial Images

Abstract: Object detection in aerial images is a fundamental yet challenging task in remote sensing field. As most objects in aerial images are in arbitrary orientations, oriented bounding boxes (OBBs) have a great superiority compared with traditional horizontal bounding boxes (HBBs). However, the regression-based OBB detection methods always suffer from ambiguity in the definition of learning targets, which will decrease the detection accuracy. In this paper, we provide a comprehensive analysis of OBB representations … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
57
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 141 publications
(58 citation statements)
references
References 57 publications
1
57
0
Order By: Relevance
“…The second stage completes the three tasks of horizontal bounding box (HBB) prediction, oriented bounding box (OBB) prediction, and semantic segmentation. The HBB head predicts HBB representations of the bridges, while the OBB head firstly generates the instance masks and then converts them into OBB representations via the method proposed in Mask OBB [24]. The semantic head takes several fully convolutional layers to predict the water segmentation map in the image.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The second stage completes the three tasks of horizontal bounding box (HBB) prediction, oriented bounding box (OBB) prediction, and semantic segmentation. The HBB head predicts HBB representations of the bridges, while the OBB head firstly generates the instance masks and then converts them into OBB representations via the method proposed in Mask OBB [24]. The semantic head takes several fully convolutional layers to predict the water segmentation map in the image.…”
Section: Methodsmentioning
confidence: 99%
“…Ding et al [19] propose RoI Transformer, which introduces a rotational RoI Learner for RPN that significantly reduces the number of preset RoIs. Wang et al [24] take an instance-segmentation-based method to generate oriented bounding boxes from instance masks called Mask OBB. Han et al [20] address the severe misalignment between oriented anchor boxes and axis-aligned convolutional features by proposing a Feature Alignment Module for anchor generation.…”
Section: A Object Detection In Aerial Imagesmentioning
confidence: 99%
See 1 more Smart Citation
“…Therefore, the HBB-based model may cause severe overlap and noise. In subsequent work, an oriented bounding box (OBB) was used to process rotating remote sensing targets [32][33][34][35][36][37][38][39][40], enabling more accurate target capture and introducing considerably less background noise. Feng et al [32] proposed a robust Student's t-distribution-aided one-stage orientation detector.…”
Section: Introductionmentioning
confidence: 99%
“…One is box-of-free feature extraction [13][14][15], in which target detection is accomplished by embedding a cosine function or embedding a class of clusters in pixels. The other is based on frame-based feature extraction, but this method of embedding clusters has two major disadvantages in the extraction process [16]. One is that the global information of the picture cannot be fully considered, and the other is that the embedded information is mainly a cosine function, so there are many restrictions before embedding, and this method must be limited in the use process.…”
Section: Introductionmentioning
confidence: 99%