2022
DOI: 10.1088/1742-6596/2171/1/012020
|View full text |Cite
|
Sign up to set email alerts
|

Spatio-Temporal Transformer for Online Video Understanding

Abstract: Leading methods in the field of online video understanding try to extract useful information from the spatial and temporal dimensions of an input video. But they are suffering from two problems: (1) These methods can only extract local video information, and cannot relate to the important features of the temporal context in the video. (2) Although some methods can quickly process the information of each frame in the video, the processing efficiency of the whole video is not good, so this type of method cannot … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 9 publications
0
2
0
Order By: Relevance
“…Because cross-validation is necessary to obtain reliable and more accurate results. In each cycle, test and training data are randomly divided and this is done 5 times [20].…”
Section: Resultsmentioning
confidence: 99%
“…Because cross-validation is necessary to obtain reliable and more accurate results. In each cycle, test and training data are randomly divided and this is done 5 times [20].…”
Section: Resultsmentioning
confidence: 99%
“…The attention-based model transformer is proved to have better performance than Long Short Term Memory (LSTM). Du et al (2022) emphasized the Spatio-temporal transformation of video understanding [12].…”
Section: Introductionmentioning
confidence: 99%