2019
DOI: 10.1007/978-3-030-11012-3_24
|View full text |Cite
|
Sign up to set email alerts
|

Bidirectional Convolutional LSTM for the Detection of Violence in Videos

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
63
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 114 publications
(70 citation statements)
references
References 27 publications
1
63
0
Order By: Relevance
“…The input, which is the concatenation of all recursive inference block outputs according to the sequence, is denoted as [ , , … , ]. ConvLSTM [49] and BiConvLSTM [50] are usually used to learn global, long-term spatiotemporal features of videos. In our approach, we considered the recursions of RIB as a temporal sequence in the order of the recursion.…”
Section: Biconvlstmmentioning
confidence: 99%
“…The input, which is the concatenation of all recursive inference block outputs according to the sequence, is denoted as [ , , … , ]. ConvLSTM [49] and BiConvLSTM [50] are usually used to learn global, long-term spatiotemporal features of videos. In our approach, we considered the recursions of RIB as a temporal sequence in the order of the recursion.…”
Section: Biconvlstmmentioning
confidence: 99%
“…Recent studies have focused on the detection of violence or fighting among multiple individuals in a crowd, with particular emphasis on violent scenes that cannot be effectively detected by the security personnel in the field. Towards this objective, the latest advances in deep learning have been exploited, whereby the temporal analysis of visual information is almost always performed using Convolutional Neural Networks (CNNs) [4], Recurrent Neural Networks (RNNs) [8], or 3D Convolutional Neural Networks (3D-CNNs) [22].…”
Section: Introductionmentioning
confidence: 99%
“…For many applications, utilising temporal consistency has been demonstrated to improve performance. Examples include depth estimation [39], motion segmentation [4], action recognition [22], super resolution [17] and superpixel segmentation [33]. Single image approaches for horizon line estimation may do gross mistakes when the image provides few or misleading clues.…”
Section: Introductionmentioning
confidence: 99%