2020
DOI: 10.1007/978-3-030-58610-2_41
|View full text |Cite
|
Sign up to set email alerts
|

DRG: Dual Relation Graph for Human-Object Interaction Detection

Abstract: We tackle the challenging problem of human-object interaction (HOI) detection. Existing methods either recognize the interaction of each human-object pair in isolation or perform joint inference based on complex appearance-based features. In this paper, we leverage an abstract spatial-semantic representation to describe each human-object pair and aggregate the contextual information of the scene via a dual relation graph (one human-centric and one object-centric). Our proposed dual relation graph effectively c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
282
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 182 publications
(284 citation statements)
references
References 48 publications
2
282
0
Order By: Relevance
“…Most of the recent researches focused on two types of visual information: appearance features and the spatial relationship. For particular, Gao et al [21] proposed an humancentric attention module for learning highlight informative regions. Zhang et al [22] proposed a spatially conditioned graph neural network to compute the messages of nodes and graph features for predicting the interactions.…”
Section: Human-object Interaction Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…Most of the recent researches focused on two types of visual information: appearance features and the spatial relationship. For particular, Gao et al [21] proposed an humancentric attention module for learning highlight informative regions. Zhang et al [22] proposed a spatially conditioned graph neural network to compute the messages of nodes and graph features for predicting the interactions.…”
Section: Human-object Interaction Detectionmentioning
confidence: 99%
“…Inspired from prior work [21], we create multiple couples of binary images which represent the bounding boxes of the primary agent and the objects to exploit the spatial position relationship of them. The first image of the image combination always illustrates the location of the main target and the second is for the surrounding detected objects.…”
Section: ) Spatial Relation Modulementioning
confidence: 99%
“…In HOI detection, one must additionally predict the location of both correctly, Subject and Object boxes, i.e., each box must have an overlap larger than 50% with its corresponding ground-truth box. To date, a body of work have approached the HOI detection problem [41], [42], [43], [44], [45], [46], [47], [48], [49], [50], [51], [52], [53], [54], [55]. Several of these works do not explicitly integrate spatial information with regard to position, size, or layout of the involved human and objects (e.g., [53]), or integrate this information or part of it in a non-transparent way in the neural network (e.g., [49], [46], [50]).…”
Section: Human Object Interaction (Hoi) Detectionmentioning
confidence: 99%
“…The objective of Human Object Interaction (HOI) detection is to locate humans and objects and to recognise their interactions. Previous studies [32][33][34][35][36][37] show promising results of HOI sensing by decoupling it into the detection and classification of objects. In particular, the results of human and object detection first come from an object detector pre-trained, and then a pair of combined proposals for human objects interaction classification.…”
Section: Human Object Interactionmentioning
confidence: 99%