2020
DOI: 10.1007/s00371-020-01878-6
|View full text |Cite
|
Sign up to set email alerts
|

Multi-frame feature-fusion-based model for violence detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
26
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 52 publications
(26 citation statements)
references
References 42 publications
0
26
0
Order By: Relevance
“…Li et al [36] fuse attention based spatial RGB stream with commonly used temporal and spatial streams to propose a multi-mode fusion method to detect violence. Asad et al [37] detect violence by combining a CNN and a wide-dense residual block to learn spatial features and LSTM units to learn temporal features. Ullah et al [38] use a lightweight CNN to identify frames containing persons and pass those frames to a 3D CNN for detection of abnormalities.…”
Section: B Deep Learning-based Methodsmentioning
confidence: 99%
“…Li et al [36] fuse attention based spatial RGB stream with commonly used temporal and spatial streams to propose a multi-mode fusion method to detect violence. Asad et al [37] detect violence by combining a CNN and a wide-dense residual block to learn spatial features and LSTM units to learn temporal features. Ullah et al [38] use a lightweight CNN to identify frames containing persons and pass those frames to a 3D CNN for detection of abnormalities.…”
Section: B Deep Learning-based Methodsmentioning
confidence: 99%
“…An LSTM network finally learned and classified the violent activity patterns over a period. Asad et al [13] adopted a multi-level feature fusion approach to integrate local motion patterns from an equally spaced sequence of input frames. They combined a wide-dense residual block with a 2D-CNN to learn combined features obtained from pairs of input frames.…”
Section: Deep Learning-based Methodsmentioning
confidence: 99%
“…So, for example, the acts of balancing attempts made by a person falling have much in common with the patterns commonly found in suspicious and violent behaviors. Therefore, the solutions targeting this problem mostly aim at providing inclusive methods for detecting multiple anomalies [3,10,11], customizing datasets to learn specific features of targeted behaviors [10], and using advanced techniques of learning the motion patterns [12][13][14], often by incorporating both spatial and temporal features. The second difficulty involves the computational complexity of behavior representation and detection algorithms, resulting in the high expense of computing resources, thus impeding their utilization in many real-world scenarios.…”
Section: Introductionmentioning
confidence: 99%
“…However, the application of this approach of abnormal behavior detection is limited, which is not conducive to its popularization. Xia et al [14][15][16][17] all used deep learning to detect abnormal actions of the human body and achieved good results with strong robustness to noises in the environment. However, due to the large number of parameters in the above model, it does not consider the problem of real-time performance in small devices.…”
Section: Introductionmentioning
confidence: 99%