2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021
DOI: 10.1109/iccv48922.2021.00024
|View full text |Cite
|
Sign up to set email alerts
|

Dance with Self-Attention: A New Look of Conditional Random Fields on Anomaly Detection in Videos

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 39 publications
(11 citation statements)
references
References 35 publications
0
11
0
Order By: Relevance
“…Weakly-supervised models use a small number of abnormal frames with labels for training. Based on the triplet loss [18] or multiple instance learning strategy [40,54,53,7,44,35], such models can learn to increase the inter-class distance between normal and abnormal data. Weakly-supervised models outperform unsupervised models but they require abnormal data with ground truth labels [50,7,44].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Weakly-supervised models use a small number of abnormal frames with labels for training. Based on the triplet loss [18] or multiple instance learning strategy [40,54,53,7,44,35], such models can learn to increase the inter-class distance between normal and abnormal data. Weakly-supervised models outperform unsupervised models but they require abnormal data with ground truth labels [50,7,44].…”
Section: Related Workmentioning
confidence: 99%
“…Despite the fact that the performance of VAD solutions has improved recently [9,35], online VAD solutions are still scarce [28,36]. Moreover, the existing ones are usually flawed.…”
Section: Introductionmentioning
confidence: 99%
“…To gain a more discriminative model utilizing local and global features and their short-range correlations, Purwanto et al [19] integrate a self-attention module containing conditional random fields (CRFs) with CNN. The relation-aware feature extractor extending the temporal relational network (TRN) is also included to extract multiscale CNN features.…”
Section: Weakly Supervised Learningmentioning
confidence: 99%
“…AnoGAN [9] is a typical generative method and EBGAN [10] borrows the concept of energy. Clustering is a widely used technique [11,18,19] to push the anomalies away from normal instances and minimize the distance between normal instances. To use features of neighboring frames, some proposed the Recurrent Neural Network (RNN)-based model to store key information in the model even if taking images as input.…”
Section: Introductionmentioning
confidence: 99%
“…Due its applicability in video surveillance, anomaly detection is an actively studied topic in the video domain, with many recent attempts trying to solve the problem by employing various approaches ranging from outlier detection * corresponding author: raducu.ionescu@gmail.com models [2,8,9,11,15,21,29,32,35,37,40,43,45,47,48,52,54,55,56,58,59,61,64,69,75,78,87,85,86] and weakly-supervised learning frameworks [16,53,67,70,83,88] to supervised open-set methods [1]. Despite the numerous attempts in solving the problem, video anomaly detection remains a challenging task, especially due to the fact that abnormal events are determined by the context.…”
Section: Introductionmentioning
confidence: 99%