2024
DOI: 10.1007/s00530-023-01254-z
|View full text |Cite
|
Sign up to set email alerts
|

You watch once more: a more effective CNN architecture for video spatio-temporal action localization

Yefeng Qin,
Lei Chen,
Xianye Ben
et al.

Abstract: The task of spatio-temporal action localization (STAL) needs to detect the action and position of individuals in the scene. Many works focus on how to improve the accuracy, but they usually ignore inference speed and practical applications. To address the above problems, we propose a new end-to-end spatio-temporal action localization network called You Watch Once More (YWOM). In this work, there are three measures proposed to improve the accuracy of positioning and recognition while guaranteeing the inference … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
references
References 57 publications
0
0
0
Order By: Relevance