2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00479
|View full text |Cite
|
Sign up to set email alerts
|

Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting

Abstract: Crowd counting is a fundamental yet challenging problem, which desires rich information to generate pixel-wise crowd density maps. However, most previous methods only utilized the limited information of RGB images and may fail to discover the potential pedestrians in unconstrained environments. In this work, we find that incorporating optical and thermal information can greatly help to recognize pedestrians. To promote future researches in this field, we introduce a large-scale RGBT Crowd Counting (RGBT-CC) be… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
75
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 110 publications
(76 citation statements)
references
References 65 publications
0
75
1
Order By: Relevance
“…Recently, auxiliary task learning based counting methods [44], [45], [46], [47], [48], [49], [50], [51], [9], [52], [53] attracted researchers' attention because of its ability to capture extra granularity information and contextual dependencies for the density map regression. Most of the methods utilized the potential of a model itself with auxiliary tasks, such as object detection, crowd segmentation, density level classification, etc., to enhance the feature tuning for density map regression.…”
Section: B Auxiliary Tasks Based Countingmentioning
confidence: 99%
“…Recently, auxiliary task learning based counting methods [44], [45], [46], [47], [48], [49], [50], [51], [9], [52], [53] attracted researchers' attention because of its ability to capture extra granularity information and contextual dependencies for the density map regression. Most of the methods utilized the potential of a model itself with auxiliary tasks, such as object detection, crowd segmentation, density level classification, etc., to enhance the feature tuning for density map regression.…”
Section: B Auxiliary Tasks Based Countingmentioning
confidence: 99%
“…We compared the proposed method with several state-ofthe-art methods, including CSRNet [9], BL [7], CMCRL [3]. As shown in Table I, our method outperforms any other SOTA methods.…”
Section: Comparison With State-of-the-art Methodsmentioning
confidence: 99%
“…As shown in Fig. 3, after we get F i−T , F i−RGB and F i−C from i th stage, we apply pyramid pooling layer to extract contextual information CI i−T , CI i−RGB and CI i−C respectively, as [3] did. By using contextual information can we avoid excessive fusing of features and mitigate the misalignment in RGBT-CC dataset.…”
Section: B Information Improvement Modulementioning
confidence: 99%
See 2 more Smart Citations