Multi-scale feature progressive fusion network for remote sensing image change detection

Lu, Di; Cheng, Shuli; Wang, Liejun; Song, Sejun

doi:10.1038/s41598-022-16329-6

Cited by 19 publications

(6 citation statements)

References 41 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…And the comparison between (2) and ( 5) in FC-Siam-Di [22] FC-Siam-Conc [22] DTCDSCN [9] STANet [2] IFNet [10] SNUNet [14] BIT…”

Section: ) Impact Of Modules On Performancementioning

confidence: 99%

See 1 more Smart Citation

VisionTwinNet: Gated Clarity Enhancement Paired With Light-Robust CD Transformers

Chen,

Chen

2024

IEEE Access

View full text Add to dashboard Cite

Deep learning has shown superiority in change detection (CD) tasks, notably the Transformer architecture with its self-attention mechanism, capturing long-range dependencies and outperforming traditional models. This capability provides the Transformer with significant advantages in capturing global-level features of complex changes in objects within high-resolution remote sensing images. Though Transformers are mature in Natural Language Processing (NLP), their application in computer vision, particularly CD tasks, is nascent. Current research on leveraging Transformers for CD reveals limitations, especially under varied lighting and seasonal changes. To address this, we propose VisionTwinNet, a twostage strategy. First, our Gated EnhanceClearNet, a specially designed deep network reduces image noise and enhances brightness, preserving shadows and correcting color distortions. With its unique gating mechanism, this network can adaptively adjust the importance of features, thereby exhibiting superior performance in various remote sensing image degradation issues. Secondly, we have developed Hybrid Light-Robust CDNet, a hybrid robust lightweight network custom-designed for CD in remote sensing images. This module deeply integrates the advantages of CNN and Transformer and introduces an innovative attention mechanism design, optimizing the key/value dimensions separately, instead of adopting traditional single linear transformations, ensuring efficient detection. Specifically, the LR-Transformer Block employs a lightweight multi-head self-attention mechanism, optimizing computational efficiency while providing richer feature representations. Comparative studies with six CD methods on three public datasets validate VisionTwinNet's robustness and efficacy. Our approach notably reduces algorithmic complexity and enhances the efficiency of the model. INDEX TERMSAutomatically adjustable framework, change detection, deep learning, multi-scale feature extraction, transformer.

show abstract

“…And the comparison between (2) and ( 5) in FC-Siam-Di [22] FC-Siam-Conc [22] DTCDSCN [9] STANet [2] IFNet [10] SNUNet [14] BIT…”

Section: ) Impact Of Modules On Performancementioning

confidence: 99%

“…At present, various CD algorithms have achieved high precision on mainstream datasets such as DSIFN, LEVIR, and WHU-CD [13], [14], [15], [16]. However, these algorithms exhibit limitations when dealing with bitemporal images under varying illumination conditions and noise.…”

Section: Introductionmentioning

confidence: 99%

VisionTwinNet: Gated Clarity Enhancement Paired With Light-Robust CD Transformers

Chen,

Chen

2024

IEEE Access

View full text Add to dashboard Cite

show abstract

“…Object detection is a trivial task for humans, but over a decade ago, having computers do it was a very challenging task. However, with the development of deep learning, computer vision technology has been widely used in multiple fields such as intelligent security 1 , autonomous driving, remote sensing detection 2,3 , medical and pharmaceutical 4 , agriculture 5 , intelligent transportation 6 , and information security 7 . The core task of object detection is to recognize and locate all instances of objects in the field of view (such as humans, dogs, cars, and tables).…”

Section: Introductionmentioning

confidence: 99%

YOLO-EMS: A Lightweight Object Detection Algorithm for Remote Sensing Images

Sun,

Wang,

You

et al. 2024

Preprint

View full text Add to dashboard Cite

Remote sensing images generally have characteristics such as changes in target direction, multi-scale target size, and dense target distribution, which make target detection in remote sensing images a challenging task. In addition, owing to the limited computing resources of detection platforms, such as drones, it is difficult to deploy detectors with large parameter quantities. This study proposes a lightweight object detection algorithm for remote sensing images called YOLO-EMS, which can ensure real-time detection while improving the detection performance of the model. First, Ghost Convolution (GhostConv) is applied to replace the traditional convolution in backbone networks. In addition, Efficient Multi-Scale Convolution modules (EMSConv) and Extended-EMSC (E-EMSConv) were proposed, which were combined with C2f modules to form EMSC-C2f, and E-EMSC-C2f was used to reduce the model size. Finally, we propose a novel bounding box regression loss function Normalized Corner Distance IoU (NCDIoU), which improves the accuracy of object detection. We compared and tested our proposed convolution module with other mainstream modules and attention mechanisms on the remote sensing image dataset RSOD, and found that our mAP50 increased by a maximum of 11.4\%. In addition, we also conducted ablation experiments on the DIOR and PASCAL VOC datasets, and our algorithm improved by 0.8\% and 0.5\% compared to YOLOv8n on mAP50. It reduces the number of parameters and FLOPs by 10.8\% and 7.4\%, respectively. Finally, we compare the proposed algorithm with other lightweight networks. Our model reduces FLOPs by up to 60.3\%, parameters by 77.9\%, and improves mAP by up to 9.5\% and 13.0\% in the DIOR and PASCAL VOC datasets, respectively.

show abstract

“…In addition, a supervised attention network was proposed for achieving a satisfactory detection map, such as deeply supervised attention metric‐based networks (DSAMNet) (Shi et al., 2022) and deeply supervised attentive high‐resolution networks (Wang et al., 2021). Siamese‐based spatial–temporal attention neural networks (Chen & Shi, 2020), cross‐temporal interaction symmetric attention networks (Lu et al., 2022), and hierarchy metric learning networks with dual attention have been widely used for LCCD with RSIs as well.…”

Section: Introductionmentioning

confidence: 99%

Cross‐attention neural network for land cover change detection with remote sensing images

Lv,

Zhong,

Wang

et al. 2024

The Photogrammetric Record

View full text Add to dashboard Cite

Land cover change detection (LCCD) with remote sensing images (RSIs) is important for observing the land cover change of the Earth's surface. Considering the insufficient performance of the traditional self‐attention mechanism used in a neural network to smoothen the noise of LCCD with RSIs, in this study a novel cross‐attention neural network (CANN) was proposed for improving the performance of LCCD with RSIs. In the proposed CANN, a cross‐attention mechanism was achieved by employing another temporal image to enhance attention performance and improve detection accuracies. First, a feature difference module was embedded in the backbone of the proposed CANN to generate a change magnitude image and guide the learning progress. A self‐attention module based on the cross‐attention mechanism was then proposed and embedded in the encoder of the proposed network to make the network pay attention to the changed area. Finally, the encoded features were decoded to obtain binary change detection with the ArgMax function. Compared with five methods, the experimental results based on six pairs of real RSIs well demonstrated the feasibility and superiority of the proposed network for achieving LCCD with RSIs. For example, the improvement for overall accuracy for the six pairs of real RSIs improved by our proposed approach is about 0.72–2.56%.

show abstract

Multi-scale feature progressive fusion network for remote sensing image change detection

Cited by 19 publications

References 41 publications

VisionTwinNet: Gated Clarity Enhancement Paired With Light-Robust CD Transformers

VisionTwinNet: Gated Clarity Enhancement Paired With Light-Robust CD Transformers

YOLO-EMS: A Lightweight Object Detection Algorithm for Remote Sensing Images

Cross‐attention neural network for land cover change detection with remote sensing images

Contact Info

Product

Resources

About