In large-scale remote sensing scenarios characterized by intricate terrain, the straightforward road imaging features in synthetic aperture radar (SAR) images make them susceptible to interference from other elements such as ridges, compromising the robustness of conventional SAR image road extraction methods. This paper introduces a method that integrates Gaofen-3 (GF-3) with a resolution of 3.0 m, Digital Elevation Models (DEMs), and Gaofen-2 (GF-2) remote sensing image data with a resolution of 4.0 m, aiming to improve the performance of road extraction in complex terrain. Leveraging DEMs, this study addresses the limitations in feature-based SAR algorithms, extending their application to complex remote sensing scenarios. Decision-level fusion, integrating SAR and multispectral images, further refines road extraction precision. To overcome issues related to terrain interference, including fragmented road segments, an adaptive rotated median filter and graph-theory-based optimization are introduced. These advancements collectively enhance road recognition accuracy and topological precision. The experimental results validate the effectiveness of the multi-source remote sensing image fusion and optimization methods. Compared to road extraction from multispectral images, the F1-score of the proposed method on the test images increased by 2.18%, 4.22%, and 1.4%, respectively.