RGB-T Semantic Segmentation With Location, Activation, and Sharpening

Li, Gongyang; Wang, Yike; Li, Zhi; Zhang, Xinpeng; Zeng, Dan

doi:10.1109/tcsvt.2022.3208833

Cited by 54 publications

(8 citation statements)

References 62 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Data augmentation is applied to the dataset used during training, including random flipping and cropping operations. To verify the semantic segmentation performance of the proposed method, this section compares it with six other semantic segmentation methods: BiSeNet [31], RTFNet [23], FuseSeg [24], GMNet [32], ABMDRNet [33], and LASNet [34]. Among them, BiSeNet is a semantic segmentation algorithm based on a single natural light image, while the others are based on the fusion of infrared and natural light images.…”

Section: Experiments Resultsmentioning

confidence: 99%

“…GMNet [32] only uses a deep feature fusion module to segment semantic regions, so its IOU metric is not good enough. ABMDRNet [33] and LASNet [34] employ different strategies to handle the fusion information of low-level and high-level data, so they obtained good results. In our method, distinct information is extracted for different layers, and the extracted high-level and global information is densely integrated into the restoration process of the original resolution of low-level information for each layer, so our method obtains the best result.…”

Section: Experiments Resultsmentioning

confidence: 99%

“…BiSeNet [31] only uses natural light images to segment, so it achieved a bad res RTFNet [23] and FuseSeg [24] only use element-wise summation and down-sampling/ sampling to fuse different modal images, they treat low-level and high-level informat in the same way, thus achieving relatively moderate results. GMNet [32] only uses a d feature fusion module to segment semantic regions, so its IOU metric is not good enou ABMDRNet [33] and LASNet [34] employ different strategies to handle the fusion in mation of low-level and high-level data, so they obtained good results. In our meth distinct information is extracted for different layers, and the extracted high-level a global information is densely integrated into the restoration process of the original re lution of low-level information for each layer, so our method obtains the best result.…”

Section: Experiments Resultsmentioning

confidence: 99%

See 2 more Smart Citations

Semantic Segmentation Algorithm Fusing Infrared and Natural Light Images for Automatic Navigation in Transmission Line Inspection

Yuan,

Wang,

Huo

et al. 2023

Electronics

View full text Add to dashboard Cite

Unmanned aerial vehicles (UAVs) are widely used in power transmission line inspection nowadays and they need to navigate automatically by recognizing the category and accurate position of transmission pylon equipment in line inspection. Semantic segmentation is an effective method for recognizing transmission pylon equipment. In this paper, a semantic segmentation algorithm that fuses infrared and natural light images is proposed. A cross-modal attention interaction activation mechanism is adopted to fully exploit the complementation between natural light and infrared images. Firstly, a global information block with a feature pyramid structure is used to deeply mine and fuse multi-scale global contextual information of fused features, and then the block is used to conduct feature aggregation in the decoding processing, and enough aggregation with multi-scale features of infrared and natural light images is used to enhance the expression ability of the model and improve the accuracy of semantic segmentation of transmission pylon equipment in complex scenes. Our method guides the process of low-level up-sampling and restoration by denser global and high-level features. Experimental results on a dataset of transmission pylon equipment collected by us show that the proposed method achieved better semantic segmentation results than the state-of-the-art methods.

show abstract

Section: Experiments Resultsmentioning

confidence: 99%

Section: Experiments Resultsmentioning

confidence: 99%

Section: Experiments Resultsmentioning

confidence: 99%

See 1 more Smart Citation

Semantic Segmentation Algorithm Fusing Infrared and Natural Light Images for Automatic Navigation in Transmission Line Inspection

Yuan,

Wang,

Huo

et al. 2023

Electronics

View full text Add to dashboard Cite

show abstract

“…Methods Modalities % mIoU GMNet [51] RGB-Infrared 49.2 LASNet [52] RGB-Infrared 42.5 EGFNet [41] RGB-Infrared 47.3 FEANet [42] RGB-Infrared 46.8 DIDFuse [43] RGB-Infrared 50.6 ReCoNet [44] RGB-Infrared 50.9 U2Fusion [53] RGB-Infrared 47.9 TarDAL [54] RGB-Infrared 48.1 SegMiF [9] RGB-Infrared 54.8 U3M (Ours) RGB-Infrared 60.8 models utilizing RGB-Infrared modalities, a combination critical for enhancing material differentiation under varying illumination conditions. Notably, our model, U3M, achieves an impressive mIoU score of 60.8, which surpasses all other models listed.…”

Section: Methodsmentioning

confidence: 99%

Cross-Domain and Cross-Modal Knowledge Distillation in Domain Adaptation for 3D Semantic Segmentation

Zhang

Xie

et al. 2022

Proceedings of the 30th ACM International Conference on Multimedia

View full text Add to dashboard Cite

Multimodal semantic segmentation is a pivotal component of computer vision and typically surpasses unimodal methods by utilizing rich information set from various sources. Current models frequently adopt modality-specific frameworks that inherently biases toward certain modalities. Although these biases might be advantageous in specific situations, they generally limit the adaptability of the models across different multimodal contexts, thereby potentially impairing performance. To address this issue, we leverage the inherent capabilities of the model itself to discover the optimal equilibrium in multimodal fusion and introduce U3M: An Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation. Specifically, this method involves an unbiased integration of multimodal visual data. Additionally, we employ feature fusion at multiple scales to ensure the effective extraction and integration of both global and local features. Experimental results demonstrate that our approach achieves superior performance across multiple datasets, verifing its efficacy in enhancing the robustness and versatility of semantic segmentation in diverse settings. Our code is available at U3M-multimodal-semantic-segmentation.

show abstract

“…I MAGES shot under low-light or backlit conditions are visually-terrible for viewers and also degenerate the performance of down-stream vision tasks, such as action recognition [1], [2], object detection [3], [4], and semantic segmentation [5], [6]. Many efforts have been tried to increase the visibility of these images to ameliorate their low quality, including upgrading imaging devices and designing image enhancement algorithms.…”

Section: Introductionmentioning

confidence: 99%

Low-Light Image Enhancement by Deep Learning Network for Improved Illumination Map

Wang

Zhang

2023

Preprint

View full text Add to dashboard Cite

RGB-T Semantic Segmentation With Location, Activation, and Sharpening

Cited by 54 publications

References 62 publications

Semantic Segmentation Algorithm Fusing Infrared and Natural Light Images for Automatic Navigation in Transmission Line Inspection

Semantic Segmentation Algorithm Fusing Infrared and Natural Light Images for Automatic Navigation in Transmission Line Inspection

Cross-Domain and Cross-Modal Knowledge Distillation in Domain Adaptation for 3D Semantic Segmentation

Low-Light Image Enhancement by Deep Learning Network for Improved Illumination Map

Contact Info

Product

Resources

About