2021
DOI: 10.3390/app112411591
|View full text |Cite
|
Sign up to set email alerts
|

Vision Transformer-Based Tailing Detection in Videos

Abstract: Tailing is defined as an event where a suspicious person follows someone closely. We define the problem of tailing detection from videos as an anomaly detection problem, where the goal is to find abnormalities in the walking pattern of the pedestrians (victim and follower). We, therefore, propose a modified Time-Series Vision Transformer (TSViT), a method for anomaly detection in video, specifically for tailing detection with a small dataset. We introduce an effective way to train TSViT with a small dataset by… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 40 publications
0
1
0
Order By: Relevance
“… Wang et al (2022) proposed variable Transformer to perform hierarchical feature learning of spatial information from electrodes to brain regions to capture spatial information of EEG signals and improve the accuracy of emotion classification tasks ( Wang et al, 2022 ). However, the research on MI-EEG signal recognition is still insufficient ( Lee et al, 2021 ; Ormerod et al, 2021 ; Singh and Mahmood, 2021 ; Zhu et al, 2021 ). The self-attention mechanism for global feature interactions between feature channels is a very effective method for feature extraction, and it has great potential for processing EEG signals because it can capture the global information of the input data very effectively ( Xie et al, 2022 ).…”
Section: Introductionmentioning
confidence: 99%
“… Wang et al (2022) proposed variable Transformer to perform hierarchical feature learning of spatial information from electrodes to brain regions to capture spatial information of EEG signals and improve the accuracy of emotion classification tasks ( Wang et al, 2022 ). However, the research on MI-EEG signal recognition is still insufficient ( Lee et al, 2021 ; Ormerod et al, 2021 ; Singh and Mahmood, 2021 ; Zhu et al, 2021 ). The self-attention mechanism for global feature interactions between feature channels is a very effective method for feature extraction, and it has great potential for processing EEG signals because it can capture the global information of the input data very effectively ( Xie et al, 2022 ).…”
Section: Introductionmentioning
confidence: 99%