2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2021
DOI: 10.1109/iros51168.2021.9636084
|View full text |Cite
|
Sign up to set email alerts
|

FEANet: Feature-Enhanced Attention Network for RGB-Thermal Real-time Semantic Segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
33
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 66 publications
(33 citation statements)
references
References 18 publications
0
33
0
Order By: Relevance
“…In contrast, our proposed CMX solution, enabling comprehensive interactions from various perspectives, generalizes smoothly in RGB-T semantic segmentation. It can be seen that our method based on SegFormer-B2 achieves mIoU of 58.2%, clearly outperforming the previous best RGB-T methods ABM-DRNet [10], FEANet [17], and GMNet [19]. Our CMX solution based on SegFormer-B4 further elevates state-of-the-art mIoU to 59.7%, widening the accuracy gap in contrast to existing methods.…”
Section: Results On Rgb-thermal Datasetmentioning
confidence: 80%
See 4 more Smart Citations
“…In contrast, our proposed CMX solution, enabling comprehensive interactions from various perspectives, generalizes smoothly in RGB-T semantic segmentation. It can be seen that our method based on SegFormer-B2 achieves mIoU of 58.2%, clearly outperforming the previous best RGB-T methods ABM-DRNet [10], FEANet [17], and GMNet [19]. Our CMX solution based on SegFormer-B4 further elevates state-of-the-art mIoU to 59.7%, widening the accuracy gap in contrast to existing methods.…”
Section: Results On Rgb-thermal Datasetmentioning
confidence: 80%
“…We embed both modalities' features RGB in ∈ R H×W×C and X in ∈ R H×W×C along the spatial axis into two attention vectors W C RGB ∈ R C and W C X ∈ R C . Different from previous channel-wise attention methods [8], [17], [98], which use global average pooling to obtain the attention vector, we apply both global max pooling and global average pooling to RGB in and X in along the channel dimension to remain more information. We concatenate the four resulted vectors, having Y ∈ R 4C .…”
Section: Cross-modal Feature Rectificationmentioning
confidence: 99%
See 3 more Smart Citations