2021
DOI: 10.3390/mi13010072
|View full text |Cite
|
Sign up to set email alerts
|

Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review

Abstract: Video object and human action detection are applied in many fields, such as video surveillance, face recognition, etc. Video object detection includes object classification and object location within the frame. Human action recognition is the detection of human actions. Usually, video detection is more challenging than image detection, since video frames are often more blurry than images. Moreover, video detection often has other difficulties, such as video defocus, motion blur, part occlusion, etc. Nowadays, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 175 publications
(230 reference statements)
0
7
0
Order By: Relevance
“…It aims to maximize the second term of the Discriminator cost function - after all, only this term can affect the Discriminator's cost function to increase it. Therefore, the following will apply to the Generator [ 8 ]: where the negative sign at the beginning has now been removed as the Generator tries by minimizing its cost function to increase that of the Discriminator, while all other sizes are as before. Because the first term of the equation depends only on the training data set, the above Generator cost function is declared as negative of the Discriminator cost function [ 22 , 23 ]: …”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…It aims to maximize the second term of the Discriminator cost function - after all, only this term can affect the Discriminator's cost function to increase it. Therefore, the following will apply to the Generator [ 8 ]: where the negative sign at the beginning has now been removed as the Generator tries by minimizing its cost function to increase that of the Discriminator, while all other sizes are as before. Because the first term of the equation depends only on the training data set, the above Generator cost function is declared as negative of the Discriminator cost function [ 22 , 23 ]: …”
Section: Methodsmentioning
confidence: 99%
“…It is continuous and differentiable and does not increase too fast. The proposed model introduces a normalization term that imposes a penalty when the norm of some of the output derivatives of Discriminator concerning its input is greater than 1 so that [ 21 ] and so, the cost functions that the two neural networks try to minimize will be [ 8 , 21 , 22 ]: …”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…As the proposed module is deployed in real-time environments, it is based on one-stage detectors. The considered state-of-the-art algorithms [ 72 ] based on one stage are You Only Look Once (YOLO) in different versions (YOLOv1, YOLOv2, YOLOv3, YOLOv4, YOLOv5) and Single-Shot MultiBox Detector (SSD). SSD has improved versions such as Deconvolutional SSD (DSSD) that includes large-scale context in object detection, Rainbow SSD (RSSD) that concatenates different feature maps using deconvolution and batch normalisation [ 73 ], and Feature-fusion SSD (FSSD) that balances semantic and positional information using bilinear interpolation to resize feature maps to the same size to be subsequently concatenated [ 74 ].…”
Section: Vision Modulementioning
confidence: 99%