U-ASD Net: Supervised Crowd Counting Based on Semantic Segmentation and Adaptive Scenario Discovery

Hafeezallah, Adel; Al-Dhamari, Ahlam; Abu-Bakar, S. A. R.

doi:10.1109/access.2021.3112174

Cited by 15 publications

(6 citation statements)

References 64 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, for PCC Net [ 25 ], its parameter space is as light as the Cascaded-MTL model, but its runtime is even higher due to the inferior running platform. As shown in Table 9 , the parameter number of the proposed CL-DCNN is one-third of the U-ASD network [ 12 ] and with a lower runtime. After considering the running platform and parameter quantity comprehensively, the effectiveness of the proposed CL-DCNN and U-ASD network is equivalent.…”

Section: Methodsmentioning

confidence: 99%

“…Furthermore, by integrating the U-net architecture with an Adaptive Scenario Discovery (ASD) module, Hafeezallah et al proposed the U-ASD-Net [ 12 ] for crowd counting. Specifically, the U-ASD-Net employs a max-unpooling layer to upsample feature maps based on maximum locations, thereby replacing the nearest upsampling method in the U-part.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

A Dilated Convolutional Neural Network for Cross-Layers of Contextual Information for Congested Crowd Counting

Zhao,

Ma,

Jia

et al. 2024

Sensors

View full text Add to dashboard Cite

Crowd counting is an important task that serves as a preprocessing step in many applications. Despite obvious improvement reported by various convolutional-neural-network-based approaches, they only focus on the role of deep feature maps while neglecting the importance of shallow features for crowd counting. In order to surmount this issue, a dilated convolutional-neural-network-based cross-level contextual information extraction network is proposed in this work, which is abbreviated as CL-DCNN. Specifically, a dilated contextual module (DCM) is constructed by importing cross-level connection between different feature maps. It can effectively integrate contextual information while conserving the local details of crowd scenes. Extensive experiments show that the proposed approach outperforms state-of-the-art approaches using five public datasets, i.e., ShanghaiTech part A, ShanghaiTech part B, Mall, UCF_CC_50 and UCF-QNRF, achieving MAE 52.6, 8.1, 1.55, 181.8, and 96.4, respectively.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

A Dilated Convolutional Neural Network for Cross-Layers of Contextual Information for Congested Crowd Counting

Zhao,

Ma,

Jia

et al. 2024

Sensors

View full text Add to dashboard Cite

show abstract

“…6 compares the results of MAE with MSE. [5] 467.0 498.5 MCNN [15] 377.6 509.1 Hydra 2's [17] 333.7 425.2 Hydra 3's [17] 465.7 371.8 Learning to count [71] 364.4 341.4 Fully convolutional [45] 338.6 424.5 Cascaded-MTL [69] 322.8 397.9 Switching-CNN [4] 318.1 439.2 CP-CNN [13] 295.8 320.9 CSRNet [2] 266.1 397.5 Transform dilated [72] 250.1 342.1 U-ASD Net [73] 232.3 217.8 S-DCNet [51] 204.2 301.3 CSCC-Net (Ours) 199.1 243.2…”

Section: Ucf_cc_50 Data Setmentioning

confidence: 99%

A Deep Learning Approach for Crowd Counting in Highly Congested Scene

Khan¹,

Kadir²,

Shah³

et al. 2022

Computers, Materials &Amp; Continua

View full text Add to dashboard Cite

With the rapid progress of deep convolutional neural networks, several applications of crowd counting have been proposed and explored in the literature. In congested scene monitoring, a variety of crowd density estimating approaches has been developed. The understanding of highly congested scenes for crowd counting during Muslim gatherings of Hajj and Umrah is a challenging task, as a large number of individuals stand nearby and, it is hard for detection techniques to recognize them, as the crowd can vary from low density to high density. To deal with such highly congested scenes, we have proposed the Congested Scene Crowd Counting Network (CSCC-Net) using VGG-16 as a core network with its first ten layers due to its strong and robust transfer learning rate. A hole dilated convolutional neural network is used at the back end to widen the relevant field to extract a large range of information from the image without losing its original resolution. The dilated convolution neural network is mainly chosen to expand the kernel size without changing other parameters. Moreover, several loss functions have been applied to strengthen the evaluation accuracy of the model. Finally, the entire experiments have been evaluated using prominent data sets namely, ShanghaiTech parts A, B, UCF_CC_50, and UCF_QNRF. Our model has achieved remarkable results i.e., 68.0 and 9.0 MAE on ShanghaiTech parts A, B, 199.1 MAE on UCF_CC_50, and 99.8 on UCF_QNRF data sets respectively.

show abstract

“…Crowd behaviour analysis [3], [4] examines the movement and interactions within crowds to improve safety. Crowd density estimation and crowd counting [5] are focused on assessing the number of people and the compactness of a crowd, which have applications in public safety and event management. Crowd anomaly detection [6] identifies unusual patterns that may indicate danger or suspicious activities, while group detection [7] explores the formation and behavior of smaller groups within the crowd.…”

Section: Introductionmentioning

confidence: 99%

Motion Pattern-Based Scene Classification Using Adaptive Synthetic Oversampling and Fully Connected Deep Neural Network

Mohammed,

Al-Dhamari,

Saeed

et al. 2023

IEEE Access

Self Cite

View full text Add to dashboard Cite

Analyzing crowded environments has become an increasingly researched topic in computer vision community, largely due to its myriad practical applications, including enhanced video surveillance systems and the estimation of crowd density in specific settings. This paper presents a comprehensive method for progressing the study of crowd dynamics and behavioral analysis, specifically focusing on the classification of movement patterns. We introduce a specialized neural network-based classifier explicitly designed for the accurate categorization of various crowd scenes. This classifier fills a unique niche in the existing literature by offering robust and adaptive classification capabilities. To optimize the performance of our model, we conduct an in-depth analysis of loss functions commonly employed in multi-class classification tasks. Our study encompasses four widely used loss functions: Focal Loss, Huber Loss, Cross-Entropy Loss, and Multi-Margin Loss. Based on empirical findings, we introduce a Joint Loss function that combines the strengths of Cross-Entropy and Multi-Margin Loss, outperforming existing methods across key performance metrics such as accuracy, precision, recall, and F1-score. Furthermore, we address the critical challenge of class imbalance in motion patterns within crowd scenes. To this end, we perform a comprehensive comparative study of two leading oversampling techniques: the synthetic minority oversampling technique (SMOTE) and adaptive synthetic sampling (ADASYN). Our results indicate that ADASYN is superior at enhancing classification performance. This approach not only mitigates the issue of class imbalance but also provides robust empirical validation for our proposed method. Finally, we subject our model to a rigorous evaluation using the Collective Motion Database, facilitating a comprehensive comparison with existing state-of-the-art techniques. This evaluation confirms the effectiveness of our model and aligns it with established paradigms in the field.

show abstract

U-ASD Net: Supervised Crowd Counting Based on Semantic Segmentation and Adaptive Scenario Discovery

Cited by 15 publications

References 64 publications

A Dilated Convolutional Neural Network for Cross-Layers of Contextual Information for Congested Crowd Counting

A Dilated Convolutional Neural Network for Cross-Layers of Contextual Information for Congested Crowd Counting

A Deep Learning Approach for Crowd Counting in Highly Congested Scene

Motion Pattern-Based Scene Classification Using Adaptive Synthetic Oversampling and Fully Connected Deep Neural Network

Contact Info

Product

Resources

About