Improving the Learning of Multi-column Convolutional Neural Network for Crowd Counting

Cheng, Zhi-Qi; Li, Jun-Xiu; Dai, Qi; Wu, Xiao; He, Jun-Yan; Hauptmann, Alexander G.

doi:10.1145/3343031.3350898

Cited by 83 publications

(24 citation statements)

References 57 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The proposed method is compared with the state-of-the-art approaches, including CG-DRCN [33], ADCrowdNet [34], DSSINet [36], Cheng et al [39], L2SM [35], etc. Table 1 shows the results of ShanghaiTech dataset.…”

Section: Results and Analysismentioning

confidence: 99%

An Enhanced Scale Robust Network for Crowd Counting

Liu

Duan

et al. 2020

IEEE Access

View full text Add to dashboard Cite

The main challenge of crowd counting is the dramatic variations in scale and perspective. Most methods combine features of different receptive fields to deal with scale problems, but do not perform well on continuous scale variations in local areas. Moreover, they use only Euclidean loss which assumes independence of each pixel without considering local correlation in the density map. Therefore, we propose the enhanced scale robust network (ESRN) for accurate and efficient crowd counting. ESRN contains an embedded GAN module, which is followed by a well-designed enhancer. The discriminator guides the generator to strengthen the local correlation and create high-quality density maps, while the enhancer combines context information in different scales and varying among different sub-regions to further enhance the scale robustness of the network. The embedded GAN module is jointly trained with the enhancer. Extensive experimental results on three challenging datasets well demonstrate the effectiveness of our method.INDEX TERMS Crowd counting, local correlation, embedded GAN module, scale robustness, enhancer.

show abstract

Section: Results and Analysismentioning

confidence: 99%

An Enhanced Scale Robust Network for Crowd Counting

Liu

Duan

et al. 2020

IEEE Access

View full text Add to dashboard Cite

show abstract

“…Sang et al [2] proposed a new model by improving the Scale-adaptive CNN (SaCNN) architecture with a backbone of fixed small receptive fields [43]. Cheng et al [3] proposed a new kind of learning strategy named Multi-column Convolutional Neural Network (McML) for multi-column networks, which could effectively solve the multi-scale learning problem of the network, and has the advantages of less parameter and be less prone to overfitting. Sindagi and Patel [5] proposed advanced counting methods which consists of multi-level and multi-directional information fusion from multi-layer networks.…”

Section: Related Work a Crowd Countingmentioning

confidence: 99%

HAGN: Hierarchical Attention Guided Network for Crowd Counting

2020

View full text Add to dashboard Cite

In recent years, deep learning based crowd counting networks have achieved significant progress. However, most of them generate rough crowd density maps due to low-resolution features used for estimating crowd distribution, which affects the performance of crowd counting. To solve this problem, in this paper, we propose a Hierarchical Attention Guided Network (HAGN) for crowd counting. We apply the first 13 layers of VGG-16 to extract base features. Then, the extracted features are processed by the Hierarchical Attention Mechanism (HAM), which guided the extracted features to enlarge step by step via our proposed attention guided branch. Finally, the outputs of HAM are fed to 1 × 1 convolutional layer for final crowd density estimation. Experiments are performed on ShanghaiTech and UCF-QNRF datasets, and our HAGN achieves promising performance compared with the other state-of-the-art methods on crowd counting and crowd localization, respectively.

show abstract

“…Recently, deep neural networks [4,6,10,32,35,41,44,58,68,71,72,76] have become mainstream in the task of crowd counting and have made remarkable progress. To acquire better performance, most of the state-of-the-art methods [13,28,31,36,40,62,66] utilized heavy backbone networks (such as the VGG model [56]) to extract features.…”

Section: Introductionmentioning

confidence: 99%

Efficient Crowd Counting via Structured Knowledge Transfer

Liu

Chen

et al. 2020

Proceedings of the 28th ACM International Conference on Multimedia

View full text Add to dashboard Cite

Crowd counting is an application-oriented task and its inference efficiency is crucial for real-world applications. However, most previous works relied on heavy backbone networks and required prohibitive run-time consumption, which would seriously restrict their deployment scopes and cause poor scalability. To liberate these crowd counting models, we propose a novel Structured Knowledge Transfer (SKT) framework, which fully exploits the structured knowledge of a well-trained teacher network to generate a lightweight but still highly effective student network. Specifically, it is integrated with two complementary transfer modules, including an Intra-Layer Pattern Transfer which sequentially distills the knowledge embedded in layer-wise features of the teacher network to guide feature learning of the student network and an Inter-Layer Relation Transfer which densely distills the cross-layer correlation knowledge of the teacher to regularize the student's feature evolution. Consequently, our student network can derive the layer-wise and cross-layer knowledge from the teacher network to learn compact yet effective features. Extensive evaluations on three benchmarks well demonstrate the effectiveness of our SKT for extensive crowd counting models. In particular, only using around 6% of the parameters and computation cost of original models, our distilled VGG-based models obtain at least 6.5× speed-up on an Nvidia 1080 GPU and even achieve state-of-the-art performance. Our code and models are available at https://github.com/HCPLab-SYSU/SKT.

show abstract

Improving the Learning of Multi-column Convolutional Neural Network for Crowd Counting

Cited by 83 publications

References 57 publications

An Enhanced Scale Robust Network for Crowd Counting

An Enhanced Scale Robust Network for Crowd Counting

HAGN: Hierarchical Attention Guided Network for Crowd Counting

Efficient Crowd Counting via Structured Knowledge Transfer

Contact Info

Product

Resources

About