Crowd Localization from Gaussian Mixture Scoped Knowledge and Scoped Teacher

Wang, Juncheng; Ji, Gao; Yuan, Yuan; Wang, Qi

doi:10.48550/arxiv.2206.05717

Cited by 1 publication

(1 citation statement)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Detection-based methods. The early crowd counting methods are mainly based on object detection [30]- [35]. After the detection of individuals, the number of targets is counted to obtain the total number of people.…”

Section: A Rgb-based Crowd Countingmentioning

confidence: 99%

MAFNet: A Multi-Attention Fusion Network for RGB-T Crowd Counting

Chen¹,

Ji²,

Yuan³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

RGB-Thermal (RGB-T) crowd counting is a challenging task, which uses thermal images as complementary information to RGB images to deal with the decreased performance of unimodal RGB-based methods in scenes with low-illumination or similar backgrounds. Most existing methods propose welldesigned structures for cross-modal fusion in RGB-T crowd counting. However, these methods have difficulty in encoding cross-modal contextual semantic information in RGB-T image pairs. Considering the aforementioned problem, we propose a two-stream RGB-T crowd counting network called Multi-Attention Fusion Network (MAFNet), which aims to fully capture long-range contextual information from the RGB and thermal modalities based on the attention mechanism. Specifically, in the encoder part, a Multi-Attention Fusion (MAF) module is embedded into different stages of the two modality-specific branches for cross-modal fusion at the global level. In addition, a Multi-modal Multi-scale Aggregation (MMA) regression head is introduced to make full use of the multi-scale and contextual information across modalities to generate high-quality crowd density maps. Extensive experiments on two popular datasets show that the proposed MAFNet is effective for RGB-T crowd counting and achieves the state-of-the-art performance.

show abstract

Section: A Rgb-based Crowd Countingmentioning

confidence: 99%