2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021
DOI: 10.1109/iccv48922.2021.01561
|View full text |Cite
|
Sign up to set email alerts
|

Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(1 citation statement)
references
References 45 publications
0
1
0
Order By: Relevance
“…We can also see the generalization ability of WSTQ and ISAAQ on DMC is slightly weaker than that on NDTF and NDMC, which may be caused by the difficulty of diagram understanding and the different data distribution between splits. For the former, explicit relations between regions like visual relation detection [30,31,33] may improve the diagram understanding. For the latter, fine grained attentions may enhance the reasoning ability to overcome the data shift [34,35].…”
Section: B Results On Ck12-qamentioning
confidence: 99%
“…We can also see the generalization ability of WSTQ and ISAAQ on DMC is slightly weaker than that on NDTF and NDMC, which may be caused by the difficulty of diagram understanding and the different data distribution between splits. For the former, explicit relations between regions like visual relation detection [30,31,33] may improve the diagram understanding. For the latter, fine grained attentions may enhance the reasoning ability to overcome the data shift [34,35].…”
Section: B Results On Ck12-qamentioning
confidence: 99%