2020
DOI: 10.48550/arxiv.2009.14639
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Dissected 3D CNNs: Temporal Skip Connections for Efficient Online Video Processing

Abstract: Convolutional Neural Networks with 3D kernels (3D CNNs) currently achieve state-of-the-art results in video recognition tasks due to their supremacy in extracting spatiotemporal features within video frames. There have been many successful 3D CNN architectures surpassing the state-of-the-art results successively. However, nearly all of them are designed to operate offline creating several serious handicaps during online operation. Firstly, conventional 3D CNNs are not dynamic since their output features repres… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 50 publications
0
1
0
Order By: Relevance
“…While some specialty architectures have been devised to let 3D convolutional network variants make predictions step by step (Singh & Cuzzolin, 2019;Köpüklü et al, 2020), and accordingly also qualify as CINs, these were not weight-compatible with regular 3D CNNs.…”
Section: Continual Inference Networkmentioning
confidence: 99%
“…While some specialty architectures have been devised to let 3D convolutional network variants make predictions step by step (Singh & Cuzzolin, 2019;Köpüklü et al, 2020), and accordingly also qualify as CINs, these were not weight-compatible with regular 3D CNNs.…”
Section: Continual Inference Networkmentioning
confidence: 99%