2020
DOI: 10.3390/app10217834
|View full text |Cite
|
Sign up to set email alerts
|

A Review of Video Object Detection: Datasets, Metrics and Methods

Abstract: Although there are well established object detection methods based on static images, their application to video data on a frame by frame basis faces two shortcomings: (i) lack of computational efficiency due to redundancy across image frames or by not using a temporal and spatial correlation of features across image frames, and (ii) lack of robustness to real-world conditions such as motion blur and occlusion. Since the introduction of the ImageNet Large Scale Visual Recognition Challenge (ILSVRC) in 2015, a g… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
29
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 88 publications
(30 citation statements)
references
References 98 publications
0
29
0
1
Order By: Relevance
“…Such efforts are valuable especially for replacing network components by more compute-friendly counterparts. However, advantages of such techniques can also be limited due to natural trade-offs between speed and performance [50] as the lower-cost network components tend to have lower expressive power. Nevertheless, one can easily incorporate low-cost model design principles into DEQ or StreamDEQ models, thanks to the architecture-agnostic definition of implicit layer models.…”
Section: Efficient Video Processing and Inferencementioning
confidence: 99%
“…Such efforts are valuable especially for replacing network components by more compute-friendly counterparts. However, advantages of such techniques can also be limited due to natural trade-offs between speed and performance [50] as the lower-cost network components tend to have lower expressive power. Nevertheless, one can easily incorporate low-cost model design principles into DEQ or StreamDEQ models, thanks to the architecture-agnostic definition of implicit layer models.…”
Section: Efficient Video Processing and Inferencementioning
confidence: 99%
“…These days, object recognition from images and videos captured by digital cameras is being preferred by people for automated tasks related to security monitoring, public safety [2], pedestrian behavior analysis, etc. Different approaches for video object detection based on deep learning were studied in [3]. Pattern classification from images was also carried out in [4][5][6].…”
Section: Introductionmentioning
confidence: 99%
“…The simplest form of clustering is called partitional clustering, which is mainly to divide a given data set into disjoint clusters. In the partitional clustering problem, each cluster with approximate similar points is a very common scenario, such as image segmentation [8] and object tracking [9] and movement detection [10], [11]. In order to solve this problem, many clustering algorithms have been proposed.…”
Section: Introductionmentioning
confidence: 99%