2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2022
DOI: 10.1109/wacv51458.2022.00339
|View full text |Cite
|
Sign up to set email alerts
|

Low-cost Multispectral Scene Analysis with Modality Distillation

Abstract: Despite its robust performance under various illumination conditions, multispectral scene analysis has not been widely deployed due to two strong practical limitations: 1) thermal cameras, especially high-resolution ones are much more expensive than conventional visible cameras; 2) the most commonly adopted multispectral architectures, twostream neural networks, nearly double the inference time of a regular mono-spectral model which makes them impractical in embedded environments. In this work, we aim to tackl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 24 publications
0
7
0
Order By: Relevance
“…In addition, KD is also employed for some multi-modal tasks, such as RGB-D salient object detection [19,20] and RGB-T pedestrian detection [15,29]. Specifically, in [20,29], the single-stream feature extractors and the early fusion strategies were employed in their student models.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…In addition, KD is also employed for some multi-modal tasks, such as RGB-D salient object detection [19,20] and RGB-T pedestrian detection [15,29]. Specifically, in [20,29], the single-stream feature extractors and the early fusion strategies were employed in their student models.…”
Section: Related Workmentioning
confidence: 99%
“…In addition, KD is also employed for some multi-modal tasks, such as RGB-D salient object detection [19,20] and RGB-T pedestrian detection [15,29]. Specifically, in [20,29], the single-stream feature extractors and the early fusion strategies were employed in their student models. However, both of them only simply employ the distillation loss functions to improve the performance of student models by using fused features or label knowledges of teacher models, and pay less attention to the huge differences between the teacher model and the student model in the unimodal feature extraction stage as well as in the multi-modal feature fusion stage.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Therefore, this work focuses on improving the effectiveness of pedestrian detectors in low-light conditions. Combining multispectral images (visible and infrared images) has proven useful for robust pedestrian detection in ADAS applications [6][7][8][9][10][11][12][13][14][15][16][17][18][19][20][21]. The signals of visible and infrared images originate from different modes and can provide scene information from different aspects.…”
Section: Introductionmentioning
confidence: 99%