2021
DOI: 10.3390/rs13081461
|View full text |Cite
|
Sign up to set email alerts
|

Dynamic Pseudo-Label Generation for Weakly Supervised Object Detection in Remote Sensing Images

Abstract: In recent years, fully supervised object detection methods in remote sensing images with good performance have been developed. However, this approach requires a large number of instance-level annotated samples that are relatively expensive to acquire. Therefore, weakly supervised learning using only image-level annotations has attracted much attention. Most of the weakly supervised object detection methods are based on multi-instance learning methods, and their performance depends on the process of scoring the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 24 publications
(19 citation statements)
references
References 38 publications
0
19
0
Order By: Relevance
“…Many studies combine CAMs [ 37 ] or Weakly Supervised Semantic Segmentation (WSSS) [ 38 ] to achieve better WSOD performances. The authors of [ 39 , 40 , 41 ] leverage the power of CAMs as segmentation proposals, [ 42 , 43 , 44 , 45 ] introduce a collaboration loop between the segmentation and detection branches, [ 46 ] proposes a cascaded convolutional neural network and [ 47 ] exploit segmentation properties, i.e., purity and completeness, to harvest tight boxes that take into account the surrounding context. Still, the actual methods cannot fully exploit CAMs as bounding box generators and require the use of external domain-dependent proposals or hybrid-annotated data.…”
Section: Related Workmentioning
confidence: 99%
“…Many studies combine CAMs [ 37 ] or Weakly Supervised Semantic Segmentation (WSSS) [ 38 ] to achieve better WSOD performances. The authors of [ 39 , 40 , 41 ] leverage the power of CAMs as segmentation proposals, [ 42 , 43 , 44 , 45 ] introduce a collaboration loop between the segmentation and detection branches, [ 46 ] proposes a cascaded convolutional neural network and [ 47 ] exploit segmentation properties, i.e., purity and completeness, to harvest tight boxes that take into account the surrounding context. Still, the actual methods cannot fully exploit CAMs as bounding box generators and require the use of external domain-dependent proposals or hybrid-annotated data.…”
Section: Related Workmentioning
confidence: 99%
“…Main objective of ML is, to classify the data based on models developed, make predictions for future and outcomes based on these models [12]. ML is broadly classified into prediction and classification [20]. Figure 1.…”
Section: Machine Learningmentioning
confidence: 99%
“…Focused research must be conducted on fusing complementary sensors and sensing abilities, simple manipulators must be developed and enhanced path optimization, navigation and path guidance algorithms suitable for dynamic climate conditions must be developed [15]. A great number of fully supervised object detection methods based on CNN have been proposed for better performance [20]. But these deep learning-based methods seeking large number of samples with instance level labels is laborious and time taking process for interpreting images with many targets [20].…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…When the lower-level labels are available, many approaches are available to localize and detect object instances [ 20 ] that leverage two prominent paradigms: multiple-instance learning (MIL) [ 21 ] and class feature activation maps (CAMs) [ 22 ]. MIL is based on learning object instances from positive (one or more instances present) or negative (no instances) bins of data samples, and this approach has been used widely for weakly supervised object detection [ 23 , 24 ]. CAMs use intermediate feature maps from the classifier’s activation layers along with pseudo-label generation, and the method has gained popularity due to its versatility for object detection as well as for instance segmentation [ 25 , 26 , 27 ].…”
Section: Introductionmentioning
confidence: 99%