2017 IEEE International Conference on Computer Vision (ICCV) 2017
DOI: 10.1109/iccv.2017.206
|View full text |Cite
|
Sign up to set email alerts
|

Generating High-Quality Crowd Density Maps Using Contextual Pyramid CNNs

Abstract: We present a novel method called Contextual Pyramid CNN (CP-CNN) for generating high-quality crowd density and count estimation by explicitly incorporating global and local contextual information of crowd images. The proposed CP-CNN consists of four modules: Global Context Estimator (GCE), Local Context Estimator (LCE), Density Map Estimator (DME) and a Fusion-CNN (F-CNN). GCE is a VGG-16 based CNN that encodes global context and it is trained to classify input images into different density classes, whereas LC… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

3
472
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 624 publications
(475 citation statements)
references
References 52 publications
3
472
0
Order By: Relevance
“…MAE MSE Idrees et al [40] 419.5 541.6 Zhang et al [37] 467.0 498.5 MCNN [17] 377.6 509.1 Switching-CNN [18] 318.1 439.2 CP-CNN [21] 295.8 320.9 DR-ResNet [38] 307.4 421.6 CSRNet [6] 266.1 397.5 ic-CNN [39] 260.9 365.5 SCAR (ours) 259.0 374.0 and cross-location evaluation) to show the capacity from different angles.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…MAE MSE Idrees et al [40] 419.5 541.6 Zhang et al [37] 467.0 498.5 MCNN [17] 377.6 509.1 Switching-CNN [18] 318.1 439.2 CP-CNN [21] 295.8 320.9 DR-ResNet [38] 307.4 421.6 CSRNet [6] 266.1 397.5 ic-CNN [39] 260.9 365.5 SCAR (ours) 259.0 374.0 and cross-location evaluation) to show the capacity from different angles.…”
Section: Methodsmentioning
confidence: 99%
“…With the development of deep learning, many CNN-based counting models [17,18,19,21,26,22,16,27] obtain good performance. Zhang et al [17] propose a multi-column FCN to encode the local features with the different kernel sizes.…”
Section: Crowd Countingmentioning
confidence: 99%
See 1 more Smart Citation
“…Detection-based methods perform well in low crowd images but could not be generalized well to high-density crowd images as detection fails miserably in such cases due to very few pixels per head or person. Density map estimation methods [22], [39], [40] Left and right graphs compare ten cases each, belonging to high dense and low dense extreme respectively for Density map [3], DenseNet [20] based direct regression and our method. As shown, other models either highly underestimate or overestimate, whereas the proposed method remains the closest to the ground truth (GT) bar in most cases.…”
Section: Introductionmentioning
confidence: 99%
“…With recent development of the convolutional neural network (CNN), researchers employ CNN to accurately estimate the crowd count from images or videos [1][2][3][4][5]. However, it is always challenging to deal with scale variations on static images, especially in diversified scenes such as different camera perspectives and irregular crowd clusters.…”
Section: Introductionmentioning
confidence: 99%