SLV: Spatial Likelihood Voting for Weakly Supervised Object Detection

Chen, Ze; Fu, Zhihang; Jiang, Rongxin; Chen, Yaowu; Hua, Xian-Sheng

doi:10.1109/cvpr42600.2020.01301

Cited by 72 publications

(45 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Using low activation values in phase II refinement can cause ambiguity and may [29] has the highest mAP (54.9%), however, our method (MIC+PI+PII) has an enormous improvement with 6.7% mAP to [29]. From Table 3, Chen et al [28] have the highest mean Corloc 71.0% among all other compared methods. In comparison to [28], the proposed method has a significantly high score with 6.0% gain in mean CorLoc.…”

Section: Implementation Detailsmentioning

confidence: 84%

“…From Table 3, Chen et al [28] have the highest mean Corloc 71.0% among all other compared methods. In comparison to [28], the proposed method has a significantly high score with 6.0% gain in mean CorLoc. Table 4 and Table 5 illustrate the results by proposed and compared methods on PASCAL VOC2012 test and trainval sets in terms of mAP and CorLoc, respectively.…”

Section: Implementation Detailsmentioning

confidence: 93%

“…They trained each task with the supervision of the accompanying task. Lately, spatial likelihood voting method is proposed by Chen et al [28] to converge proposal localization for object detection with image-level supervision using multi-task learning. Recently, Zhang et al [14] proposed region-searching paradigm for WSOD with reinforcement learning approach under weak supervisions.…”

Section: Related Workmentioning

confidence: 99%

“…These methods [27], [28], [29] have achieved suitable performance, however, these approaches still have the problems of missing detection in case of occluded objects and wrong detections due to objects cluster. Since the training image is decomposed into thousands of proposals, and each approximately correct training instance is flooded with many incorrect training instances.…”

Section: Related Workmentioning

confidence: 99%

See 3 more Smart Citations

A Robust Context-Aware Proposal Refinement Method for Weakly Supervised Object Detection

Awan

Shin

2020

IEEE Access

View full text Add to dashboard Cite

Supervised object detection models require fully annotated data for training the network. However, labeling large datasets is a very time-consuming task, therefore, weakly supervised object detection (WSOD) is a substitute approach to fully supervised learning for the object detection task. Many methods have been proposed for WSOD to date, their performance is still lower than supervised approaches since WSOD is a very challenging task. The major problem with existing WSOD methods is partial object detection and false detection in an objects cluster with the same category. The majority of the methods on WSOD follow multiple instance learning approaches, which does not guarantee the completeness of detected objects. To address these issues, we propose a three-fold refinement strategy to proposals to learn complete instances. We generate class-specific localization maps by fused class activation maps obtained from fused complementary classification networks. These localization maps are used to amend the detected proposals from the instance classification branch (detection network). Deep reinforcement learning networks are proposed to learn decisive-agent and rectifying-agent based on policy gradient algorithm to further refine the proposals. The refined bounding boxes are then fed to instance classification network. The refinement operations result in learning complete objects and greatly improve detection performance. Experimental results show better detection performance by the proposed WSOD method compared to the state-of-the-art methods on PASCAL VOC2007 and VOC2012 benchmarks.INDEX TERMS Weakly supervised object detection, complementary learning, discriminative features, proposal refinement, class activation maps, reinforcement learning, and deep learning.

show abstract

Section: Implementation Detailsmentioning

confidence: 84%

Section: Implementation Detailsmentioning

confidence: 93%

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 2 more Smart Citations

A Robust Context-Aware Proposal Refinement Method for Weakly Supervised Object Detection

Awan

Shin

2020

IEEE Access

View full text Add to dashboard Cite

show abstract

“…Lin et al [28] proposed object instance mining algorithm that can help detect more possible objects. [13], [29] and [14] proposed to combine the MIL branch with a single or multiple online regression branch to achieve relocalization of proposals. These methods are all based on a multiple instance detection network, so it is hard to avoid the non-convex optimization problem brought by MIL.…”

Section: B Weakly Supervised Object Detectionmentioning

confidence: 99%

Efficient Weakly-Supervised Object Detection With Pseudo Annotations

et al. 2021

View full text Add to dashboard Cite

Weakly-supervised object detection (WSOD) has attracted lots of attention in recent years. However, there is still a big gap between WSOD and generic object detection. The main barriers to the efficiency of WSOD are the ineffective data augmentations and inaccurate bounding box predictions. Given only the image-level annotations, it's hard for WSOD to effectively utilize variant data augmentations and accurately regress the bounding boxes. Although a fully-supervised object detector can be trained using annotations generated from the weakly-supervised obejct detector, the performance is still severely limited due to the low quality of mined pseudo annotations. This paper proposes an efficient WSOD method with pseudo annotations (EWPA) to make better use of imperfect annotations. With the assistance of pseudo annotations, EWPA can effectively regress more accurate bounding boxes while the traditional WSOD can only locate the salient parts of an object. Furthermore, pseudo annotations can help design more complex data augmentations to drive the network learning more discriminative feature representations. Extensive experiments are conducted on PASCAL VOC 2007 and 2012 datasets and validate the effectiveness of EWPA.

show abstract