2017 IEEE International Conference on Computer Vision (ICCV) 2017
DOI: 10.1109/iccv.2017.396
|View full text |Cite
|
Sign up to set email alerts
|

FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras

Abstract: In this paper, we develop deep spatio-temporal neural networks to sequentially count vehicles from low quality videos captured by city cameras (citycams). Citycam videos have low resolution, low frame rate, high occlusion and large perspective, making most existing methods lose their efficacy. To overcome limitations of existing methods and incorporate the temporal information of traffic video, we design a novel FCN-rLSTM network to jointly estimate vehicle density and vehicle count by connecting fully convolu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
87
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 213 publications
(91 citation statements)
references
References 41 publications
0
87
0
Order By: Relevance
“…ing estimating the number of vehicles in traffic congestion [29,30,56,14,28], counting the cells and bacteria from microscopic images [20,42,44,45,8], and animal crowd estimations for ecological survey [27,1,18], to name a few.…”
Section: Introductionmentioning
confidence: 99%
“…ing estimating the number of vehicles in traffic congestion [29,30,56,14,28], counting the cells and bacteria from microscopic images [20,42,44,45,8], and animal crowd estimations for ecological survey [27,1,18], to name a few.…”
Section: Introductionmentioning
confidence: 99%
“…Computer vision-based crowd counting [8,17,26,27,36,44,48,56,68,69,74,77] has witnessed tremendous progress in the recent years. Algorithms developed for crowd counting have found a variety of applications such as video and traffic surveillance [15,21,38,59,64,71,72], agriculture monitoring (plant counting) [35], cell counting [22], scene understanding, urban planning and environmental survey [11,68].…”
Section: Introductionmentioning
confidence: 99%
“…The most relevant works to us are [32,39], which also incorporate ConvLSTM for spatial-temporal modeling. However, they are used for consecutive video frames representation and aims to estimate the crowd counting on a given surveillance image instead of forecasting crowd flow evolution based on mobility data.…”
Section: Related Workmentioning
confidence: 99%