2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
DOI: 10.1109/cvpr.2017.451
|View full text |Cite
|
Sign up to set email alerts
|

Learning Cross-Modal Deep Representations for Robust Pedestrian Detection

Abstract: This paper presents a novel method for detecting pedestrians under adverse illumination conditions. Our approach relies on a novel cross-modality learning framework and it is based on two main phases. First, given a multimodal dataset, a deep convolutional network is employed to learn a non-linear mapping, modeling the relations between RGB and thermal data. Then, the learned feature representations are transferred to a second deep network, which receives as input an RGB image and outputs the detection results… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
106
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 200 publications
(106 citation statements)
references
References 43 publications
0
106
0
Order By: Relevance
“…Among the sensors, thermal camera is widely used in face recognition [3,44,27], human tracking [30,46] and action recognition [59,15] for its biometric robustness. Motivated by this, multispectral pedestrian detection [24,52,17,39] has attracted massive attention and provides new opportunities for around-theclock applications, mainly due to its superiority of complementary nature between color and thermal modalities.…”
Section: Introductionmentioning
confidence: 99%
“…Among the sensors, thermal camera is widely used in face recognition [3,44,27], human tracking [30,46] and action recognition [59,15] for its biometric robustness. Motivated by this, multispectral pedestrian detection [24,52,17,39] has attracted massive attention and provides new opportunities for around-theclock applications, mainly due to its superiority of complementary nature between color and thermal modalities.…”
Section: Introductionmentioning
confidence: 99%
“…Deep learning based methods for pedestrian detection [8,9,10,205,206,207,208,209,210,211] For pedestrian detection, one of the most significant challenges is to handle occlusion [216,217,218,219,220,221,222,223,224,225,214,226]. A straightforward method is to use part-based models which learn a series of part detectors and integrate the results of part detectors to locate and classify objects.…”
Section: Pedestrian Detectionmentioning
confidence: 99%
“…The problem of RGBT tracking could be considered as an extension of visual tracking, and its goal is to estimate target states using the complementary advantages of visible spectrum (called RGB in this paper) and thermal infrared information given the initial state in the first pair of frame. It has been receiving much more attention re-cently and becoming more and more popular partly due to the following reasons: i) RGB and thermal data have strong complementary advantages and thus could overcome imaging limitations of individual source [19,31,32,22]. ii) Thermal infrared cameras are economically available in recent years [6], making RGBT data easier to access in various applications, such as object segmentation [19], person Re-ID [31] and pedestrian detection [10,32].…”
Section: Introductionmentioning
confidence: 99%
“…It has been receiving much more attention re-cently and becoming more and more popular partly due to the following reasons: i) RGB and thermal data have strong complementary advantages and thus could overcome imaging limitations of individual source [19,31,32,22]. ii) Thermal infrared cameras are economically available in recent years [6], making RGBT data easier to access in various applications, such as object segmentation [19], person Re-ID [31] and pedestrian detection [10,32]. iii) Recent RGBT tracking benchmark datasets [17,22] provide a flexible evaluation platform of various RGBT trackers.…”
Section: Introductionmentioning
confidence: 99%