2021
DOI: 10.48550/arxiv.2103.04503
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

End-to-End Human Object Interaction Detection with HOI Transformer

Abstract: We propose HOI Transformer to tackle human object interaction (HOI) detection in an end-to-end manner. Current approaches either decouple HOI task into separated stages of object detection and interaction classification or introduce surrogate interaction problem. In contrast, our method, named HOI Transformer, streamlines the HOI pipeline by eliminating the need for many hand-designed components. HOI Transformer reasons about the relations of objects and humans from global image context and directly predicts H… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 38 publications
0
1
0
Order By: Relevance
“…For encoding the representation network f (•) of pose sequences in our task, we adopt the Transformer encoder network [54] for its superior performance on recent vision applications [7,16,34,42,60].…”
Section: Transformer Backbone Detailsmentioning
confidence: 99%
“…For encoding the representation network f (•) of pose sequences in our task, we adopt the Transformer encoder network [54] for its superior performance on recent vision applications [7,16,34,42,60].…”
Section: Transformer Backbone Detailsmentioning
confidence: 99%