In multi-source fusion deformation-monitoring methods that utilize fiber Bragg grating (FBG) data and other data types, the lack of FBG constraint points in edge regions often results in inaccuracies in fusion results, thereby impacting the overall deformation-monitoring accuracy. This study proposes a multi-source fusion deformation-monitoring calibration method and develops a calibration model that integrates vision and FBG multi-source fusion data. The core of this model is a normal distribution transform (NDT)–convolutional neural network (CNN)–self-attention (SA) calibration network. This network enhances continuity between points in point clouds using the NDT module, thereby reducing outliers at the edges of the fusion results. Experimental validation shows that this method reduces the absolute error to below 0.2 mm between multi-source fusion calibration results and high-precision measured point clouds, with a confidence interval of 99%. The NDT-CNN-SA network offers significant advantages, with a performance improvement of 36.57% over the CNN network, 14.39% over the CNN–gated recurrent unit (GRU)–convolutional block attention module (CBAM) network, and 9.54% over the CNN–long short term memory (LSTM)–SA network, thereby demonstrating its superior generalization, accuracy, and robustness. This calibration method provides smoother and accurate structural deformation data, supports real-time deformation monitoring, and reduces the impact of assembly deviation on product quality and performance.