Proceedings of the 2023 ACM International Conference on Multimedia Retrieval 2023
DOI: 10.1145/3591106.3592267
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Label Meta Weighting for Long-Tailed Dynamic Scene Graph Generation

Abstract: This paper investigates the problem of scene graph generation in videos with the aim of capturing semantic relations between subjects and objects in the form of ⟨subject, predicate, object⟩ triplets. Recognizing the predicate between subject and object pairs is imbalanced and multi-label in nature, ranging from ubiquitous interactions such as spatial relationships (e.g. in front of ) to rare interactions such as twisting. In widely-used benchmarks such as Action Genome and VidOR, the imbalance ratio between th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 36 publications
(84 reference statements)
0
0
0
Order By: Relevance