Defects in various products are unavoidable because of measurement errors and equipment accuracy limitations in the production process. Recent advances in metal surface defect detection have focused on optimizing traditional methods, developing new detection techniques, and exploring deep learning-based algorithms, providing technological support to improve metal manufacturing quality and production efficiency. To ensure the highest yield rate and meet production requirements, all products must undergo defect inspections before leaving the factory. However, Traditional methods for detecting metal surface defects require a lot of manual involvement, are difficult to accurately detect small defects, are susceptible to environmental interference, and lack stability and reliability. To address this issue, we propose the MeDERT model for metal surface defect detection. Our approach involves a new Spansensitive Texture Fusion (STF) module structure that focuses on multi-headed attention modules to recover lost details and boost inspection speed and on top of that use the Jump-sensitive detail recovery feature fusion module to ensure the validity of the extracted textures. Additionally, we introduce singular value decomposition and pretzel noise to model the noise and enhance model robustness through data augmentation. Our MeDERT model achieved state-of-the-art (SOTA) results on a specified dataset, demonstrating its effectiveness in enhancing inspection efficiency and accuracy.