2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021
DOI: 10.1109/iccv48922.2021.00747
|View full text |Cite
|
Sign up to set email alerts
|

OadTR: Online Action Detection with Transformers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
60
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 93 publications
(60 citation statements)
references
References 33 publications
0
60
0
Order By: Relevance
“…Earlier NLP [2], [47] and image-based works [7] introduced the use of GeLU [75] (instead of ReLU) as the activation function for the hidden layer in the FF sub-layer. This trend has been followed by a number of video works [9], [49], [64], [67], [71], [76] (see Tab. 1).…”
Section: Activation In Ffnmentioning
confidence: 99%
See 4 more Smart Citations
“…Earlier NLP [2], [47] and image-based works [7] introduced the use of GeLU [75] (instead of ReLU) as the activation function for the hidden layer in the FF sub-layer. This trend has been followed by a number of video works [9], [49], [64], [67], [71], [76] (see Tab. 1).…”
Section: Activation In Ffnmentioning
confidence: 99%
“…Some tasks focusing on frame-level predictions (such as video summarization [40]) may not require finer patch-level based granularity. Many VTs leverage frames as tokens (e.g., [14], [57], [59], [66], [67], [69], [93], [114]), achieving a good balance between computational cost and performance.…”
Section: Tokenizationmentioning
confidence: 99%
See 3 more Smart Citations