2022
DOI: 10.1109/tpami.2020.3046647
|View full text |Cite
|
Sign up to set email alerts
|

Weakly Supervised Object Detection Using Proposal- and Semantic-Level Relationships

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
16
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 107 publications
(17 citation statements)
references
References 55 publications
0
16
0
1
Order By: Relevance
“…Many studies combine CAMs [ 37 ] or Weakly Supervised Semantic Segmentation (WSSS) [ 38 ] to achieve better WSOD performances. The authors of [ 39 , 40 , 41 ] leverage the power of CAMs as segmentation proposals, [ 42 , 43 , 44 , 45 ] introduce a collaboration loop between the segmentation and detection branches, [ 46 ] proposes a cascaded convolutional neural network and [ 47 ] exploit segmentation properties, i.e., purity and completeness, to harvest tight boxes that take into account the surrounding context. Still, the actual methods cannot fully exploit CAMs as bounding box generators and require the use of external domain-dependent proposals or hybrid-annotated data.…”
Section: Related Workmentioning
confidence: 99%
“…Many studies combine CAMs [ 37 ] or Weakly Supervised Semantic Segmentation (WSSS) [ 38 ] to achieve better WSOD performances. The authors of [ 39 , 40 , 41 ] leverage the power of CAMs as segmentation proposals, [ 42 , 43 , 44 , 45 ] introduce a collaboration loop between the segmentation and detection branches, [ 46 ] proposes a cascaded convolutional neural network and [ 47 ] exploit segmentation properties, i.e., purity and completeness, to harvest tight boxes that take into account the surrounding context. Still, the actual methods cannot fully exploit CAMs as bounding box generators and require the use of external domain-dependent proposals or hybrid-annotated data.…”
Section: Related Workmentioning
confidence: 99%
“…OIM [23] 尝试通过在空间图和外观图上引入信息, 引入空间局部上下文关系检测所有实例, 但没有对全局上下文关系和实例信息深度挖掘, 这限制性能进一步提高. WSOD2 [19] 利用自底向上的 方法蒸馏目标区域, PSLR [22] 利用图卷积神经网络挖掘局部上下文和全局上下文信息, 提高检测到完 整目标的概率. 但忽略最高得分框包含多个实例的情况, 在定位精度上明显低于本文算法.…”
Section: 对比实验unclassified
“…但忽略最高得分框包含多个实例的情况, 在定位精度上明显低于本文算法. C-WSL [78] , PCL [41] , C-MIL [43] , W2F [39] , wetectron [ 3 Comparison of the performance of the proposed approach and state-of-the-art methods on the VOC 2012 set OICR [16] PCL [41] MELM [17] W2F [39] C-WSL [78] C-MIL [43] C-MIDN [40] OIM+IR [23] PSLR [22] [19] C-MIDN* OIM+IR* wetectron [79]…”
Section: 对比实验mentioning
confidence: 99%
“…Despite good performances on coarse-grained datasets (e.g., Animal with Attribute dataset [3]), these approaches gradually degenerate when dealing with fine-grained datasets (e.g., Caltech-UCSD Birds-200-2011 dataset [8]), since much more local discriminative information is required to distinguish these fine-grained classes. Several recent works [9], [10], [11], [12], [13] try to focus on discriminative visual feature learning, by introducing attention mechanism into zero-shot classification problem, such as the spatial and channel attention [9], region attention [10], [14]. However, there still exists significant semantic gap between the visual modality and the attribute modality in the existing passive attention mechanisms as these methods generate attention weights purely in the bottom-up forward passing manner, which lacks the necessary top-down guidance and semantic alignment for attending to the real attribute-correlation regions.…”
Section: Introductionmentioning
confidence: 99%