2023
DOI: 10.1109/access.2023.3267409
|View full text |Cite
|
Sign up to set email alerts
|

Violence Detection Enhancement by Involving Convolutional Block Attention Modules Into Various Deep Learning Architectures: Comprehensive Case Study for UBI-Fights Dataset

Abstract: The violence detection in surveillance videos is a complicated task, due to the requirements of extracting the spatio-temporal features in different videos environment, and various videos prospective cases. Hereby, in this paper, different architectures are proposed to perform this task in high performance, by using the UBI-Fights dataset as a comprehensive case study. The proposed architectures are based on involving the Convolutional Block Attention Modules (CBAM) with other simple layers (e.g., ConvLSTM2D o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 29 publications
0
4
0
Order By: Relevance
“…Finally, a dense layer with 10 units (i.e., one output per each driver case class) and a softmax activation function, are added to extract the final probability per each class. The hyper-parameters for the proposed architecture are adjusted based on the main author's experience of handling such cases over the course of many papers [28][29][30][31].…”
Section: Overall Proposed Architecture Layoutmentioning
confidence: 99%
“…Finally, a dense layer with 10 units (i.e., one output per each driver case class) and a softmax activation function, are added to extract the final probability per each class. The hyper-parameters for the proposed architecture are adjusted based on the main author's experience of handling such cases over the course of many papers [28][29][30][31].…”
Section: Overall Proposed Architecture Layoutmentioning
confidence: 99%
“…While there are datasets like UT-Interaction, UCF101 and the Kinetics action recognition dataset that include examples of actions like boxing and punching, these datasets feature lowerresolution inputs and do not align with the requirement of video feeds capturing physical abuse actions. After extensive search [33], [34], we were able to identify two publicly available datasets UBI-Fights [35] and UCF Crime [36], that have real life video clips of common people hitting, kicking, beating and such fight sequences. This was the nearest match that we could get for detecting physical abuse.…”
Section: Stage-3mentioning
confidence: 99%
“…Improving the accuracy of motion recognition using both visual and motion information is a major feature of I3D. Additionally, anomaly detection studies continue to be published [51][52][53][54][55][56][57][58][59][60].…”
Section: I3dmentioning
confidence: 99%