ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
DOI: 10.1109/icassp43922.2022.9747376
|View full text |Cite
|
Sign up to set email alerts
|

Video Anomaly Detection via Prediction Network with Enhanced Spatio-Temporal Memory Exchange

Abstract: Video anomaly detection is a challenging task because most anomalies are scarce and non-deterministic. Many approaches investigate the reconstruction difference between normal and abnormal patterns, but neglect that anomalies do not necessarily correspond to large reconstruction errors. To address this issue, we design a Convolutional LSTM Auto-Encoder prediction framework with enhanced spatiotemporal memory exchange using bi-directionalilty and a higher-order mechanism. The bi-directional structure promotes l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(3 citation statements)
references
References 18 publications
0
3
0
Order By: Relevance
“…Unsupervised models exclusively employ normal videos for training, with data mapping via an encoder-decoder structure being the most frequently used strategy. The baseline approaches in this context are based on data reconstruction or prediction, where anomalous events are detected based on the reconstruction or prediction errors, respectively [12,52,24,29,49,21,38]. Adversarial losses can also be applied in such approaches, where the generator is adversarially trained to perform the data mapping so that abnormal videos lead to unrealistic outputs with large reconstruction or prediction errors [37,19,45,30,51].…”
Section: Related Workmentioning
confidence: 99%
“…Unsupervised models exclusively employ normal videos for training, with data mapping via an encoder-decoder structure being the most frequently used strategy. The baseline approaches in this context are based on data reconstruction or prediction, where anomalous events are detected based on the reconstruction or prediction errors, respectively [12,52,24,29,49,21,38]. Adversarial losses can also be applied in such approaches, where the generator is adversarially trained to perform the data mapping so that abnormal videos lead to unrealistic outputs with large reconstruction or prediction errors [37,19,45,30,51].…”
Section: Related Workmentioning
confidence: 99%
“…Shen et al. [48] designed a ConvLSTM prediction framework for AE with enhanced spatio‐temporal memory exchange using bi‐directionality and a higher‐order mechanisms. Chang et al.…”
Section: Introductionmentioning
confidence: 99%
“…Hao et al [28] proposed a spatiotemporal consistency-enhanced network to generate spatiotemporal consistency predictions. Shen et al [48] designed a ConvLSTM prediction framework for AE with enhanced spatio-temporal memory exchange using bi-directionality and a higher-order mechanisms. Chang et al [29] explore a novel convolution AE architecture that can dissociate the spatio-temporal representation to separately capture the spatial and the temporal information, and exploit a variance-based attention module and insert it into the motion AE to highlight large movement areas.…”
Section: Introductionmentioning
confidence: 99%