2019
DOI: 10.1109/access.2019.2918936
|View full text |Cite
|
Sign up to set email alerts
|

An Automatic Scale-Adaptive Approach With Attention Mechanism-Based Crowd Spatial Information for Crowd Counting

Abstract: This paper proposes an automatic scale-adaptive approach with attention mechanism-based crowd spatial information addressing the crowd counting task, i.e. a novel cascaded crowd counting network. The proposed network is composed of a classification sub-network to estimate crowd scales and the main network to predict the corresponding density maps. First, the image serves as the input of the classification network and the main network. Second, according to the estimated crowd scale results, the main network str… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(3 citation statements)
references
References 30 publications
0
3
0
Order By: Relevance
“…Sang et al [21] optimized the geometric adaptive Gaussian kernel function of SaCNN to generate a higher quality real DM. Kong et al [22] proposed an adaptive attention mechanism method to automatically adjust the network structure through the crowd size.…”
Section: Methods Based On Cnnmentioning
confidence: 99%
See 1 more Smart Citation
“…Sang et al [21] optimized the geometric adaptive Gaussian kernel function of SaCNN to generate a higher quality real DM. Kong et al [22] proposed an adaptive attention mechanism method to automatically adjust the network structure through the crowd size.…”
Section: Methods Based On Cnnmentioning
confidence: 99%
“…e performance of the detector in dense scenes is improved, but this scheme is only applicable to video stream data. In dense scenes, due to the severe occlusion, Vora [2] and Kong et al [22] detected the crowd heads, which increased the accuracy of detection. Vora [2] proposed faster R-CNN directly for binary classification tasks, to determine whether the detection frame is a human head and to reduce the number of anchor boxes according to the human head scale, speeding up the detection process.…”
Section: Methods Based On Cnnmentioning
confidence: 99%
“…However, due to heavy occlusions, scale and perspective variations, and irregular cluster, it is extremely difficult to generate accurate density value for each pixel of the input crowd image. Inspired by the successful use of convolution neural network (CNN) in image recognition and image segmentation tasks [ [11]. These methods tend to utilize CNNs with different size of receptive fields to obtain scale aggregation features for adapting the large variation in crowd density.…”
Section: Introductionmentioning
confidence: 99%