2020
DOI: 10.1007/978-3-030-58523-5_46
|View full text |Cite
|
Sign up to set email alerts
|

Improving Multispectral Pedestrian Detection by Addressing Modality Imbalance Problems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
102
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 148 publications
(102 citation statements)
references
References 38 publications
0
102
0
Order By: Relevance
“…We evaluated the proposed fusion method on the KAIST testing dataset in the reasonable setting and in all settings in comparison with ACF+T+THOG [ 14 ], Halfway Fusion [ 22 ], Fusion RPN+BDT [ 30 ], IAF R-CNN [ 28 ], IATDNN+IASS [ 27 ], CIAN [ 37 ], MSDS-RCNN [ 17 ], ARCNN [ 38 ], MBNet [ 39 ], and FusionCSPNet [ 18 ]. Among these detection methods, FusionCSPNet and our method were one-stage methods, and the rest were two-stage methods.…”
Section: Methodsmentioning
confidence: 99%
“…We evaluated the proposed fusion method on the KAIST testing dataset in the reasonable setting and in all settings in comparison with ACF+T+THOG [ 14 ], Halfway Fusion [ 22 ], Fusion RPN+BDT [ 30 ], IAF R-CNN [ 28 ], IATDNN+IASS [ 27 ], CIAN [ 37 ], MSDS-RCNN [ 17 ], ARCNN [ 38 ], MBNet [ 39 ], and FusionCSPNet [ 18 ]. Among these detection methods, FusionCSPNet and our method were one-stage methods, and the rest were two-stage methods.…”
Section: Methodsmentioning
confidence: 99%
“…Miss Rate (lower, better) All Day Night ACF [1] 47.32% 42.57% 56.17% Halfway Fusion [21] 25.75% 24.88% 26.59% Fusion RPN+BF [5] 18.29% 19.57% 16.27% IAF R-CNN [10] 15.73% 14.55% 18.26% IATDNN+IASS [9] 14.95% 14.67% 15.72% CIAN [7] 14.12% 14.77% 11.13% MSDS-RCNN [6] 11.34% 10.53% 12.94% AR-CNN [18] 9.34% 9.94% 8.38% MBNet [12] 8.13% 8.28% 7.86% Ours (full dataset) 8.86% 10.01% 6.77% Ours (10.26% of data) 9.32% 10.13% 7.70% [3] 39.7% 36.1% 36.8% FuseNet [14] 45.6% 41.0% 43.9% RTFNet [13] 53.2% 45.8% 54.8% Ours (full dataset) 53.6% 46.8% 53.3% Ours (17.99% of data) 51.0% 46.6% 48.9% Table 3. mIoU comparisons on TOKYO Dataset.…”
Section: Methodsmentioning
confidence: 99%
“…It turns out that the half-way feature fusion outperforms early or late fusion. Moreover, [7,8] apply attention mechanisms to learn an automatic re-weighting of visible and thermal features in the fusion module; [9,10] utilize illumination information as a guidance for the adaptive fusion of both features; [11,12] alleviate the inconsistency between visible and thermal features to facilitate the optimization of a dual-modality network.…”
Section: Related Workmentioning
confidence: 99%
“…The research focus has then shifted to feature-level fusion: [18,13] studied the optimal fusion "timing" in the detection network and came to the same conclusion that halfway feature fusion produces better results; [14] introduced an auxiliary segmentation task on the basis of halfway feature fusion for further performance improvements. [29,27] applied attention mechanisms to adaptively weigh the visible and thermal features in the feature fusion stage; [26,32] alleviated the inconsistency between visible and thermal features to facilitate the optimization process of a dualmodality network. Apart from these studies on featurelevel fusion, multiple decision-level fusion methods were suggested: [6,15] used illumination information to guide the fusion of predictions (decisions) from visible/thermal images or from day/night sub-networks; [30] discussed a confidence-aware fusion mechanism, where the disagreement between visible and thermal predictions is used to reweigh visible contributions, which could also be regarded as a decision-level fusion approach.…”
Section: Multispectral Scene Analysismentioning
confidence: 99%
“…Multispectral information fusion methods can be categorized into: image-level fusion, feature-level fusion and decision-level fusion. Architectures that implement a feature-level fusion, usually within a two-stream network architecture (one dedicated to each source), have been proven to outperform the other strategies, and are currently the most studied in the literature [18,13,14,29,27,32,26,7,21,8]. However, the computational overhead provided by two-stream networks is huge, which is particularly undesirable for software deployment on embedded devices.…”
Section: Introductionmentioning
confidence: 99%