2021
DOI: 10.1049/ipr2.12258
|View full text |Cite
|
Sign up to set email alerts
|

Anomaly detection in video sequences: A benchmark and computational model

Abstract: Anomaly detection has attracted considerable search attention. However, existing anomaly detection databases encounter two major problems. Firstly, they are limited in scale. Secondly, training sets contain only video-level labels indicating the existence of an abnormal event during the full video while lacking annotations of precise time durations. To tackle these problems, we contribute a new Large-scale Anomaly Detection (LAD) database as the benchmark for anomaly detection in video sequences, which is feat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 42 publications
(15 citation statements)
references
References 55 publications
0
15
0
Order By: Relevance
“…To this end, we combine a global temporal contrastive loss with a pair-wise video clip order prediction task. We show that our proposed approach outperforms all previously proposed self-supervised models for unintentional action prediction and achieves state-of-the-art results on two datasets of unintentional actions: the Oops dataset [9] and LAD2000 [32].…”
Section: Introductionmentioning
confidence: 76%
See 1 more Smart Citation
“…To this end, we combine a global temporal contrastive loss with a pair-wise video clip order prediction task. We show that our proposed approach outperforms all previously proposed self-supervised models for unintentional action prediction and achieves state-of-the-art results on two datasets of unintentional actions: the Oops dataset [9] and LAD2000 [32].…”
Section: Introductionmentioning
confidence: 76%
“…We conducted our experiments on two datasets with unintentional or anomalous videos: Oops [9] and LAD2000 [32]. We present the results on the LAD2000 dataset in the supplementary material.…”
Section: Datasetsmentioning
confidence: 99%
“…Wan et al [31] Wu et al [34] proposed a dual branch network using multi-detail concepts in both the temporal and spatial dimensions as input. C3D was used to extract the features, and namely spatio-temporal (ST-UCFcrime) was used to implement them on a new dataset (ST-UCF-crime) that annotates spatial-temporal bounding boxes for unusual occurrences in UCF crime.…”
Section: Related Workmentioning
confidence: 99%
“…Outdoor fall detection needs to be feasible in non-static environments with changing backgrounds, light conditions, density and clutter. This necessitates more robust approaches combining video motion detection and temporal features using three dimensional LSTM and ConvLSTM [ 12 , 40 ], extracting depth from single images using Markov Random Fields and human detection using particle swarm optimization [ 41 ] or body posture analysis using two-branch multi-stage CNNs [ 42 ].…”
Section: Related Workmentioning
confidence: 99%