Timely and accurate detection of pavement cracks, the most common type of road damage, is essential for ensuring road safety. Automatic image segmentation of cracks can accurately locate their pixel positions. This paper proposes a Multi-Scale Feature Enhanced U-shaped Network (MFE-UNet) for pavement crack detection. This network model uses a Residual Detail-Enhanced Block (RDEB) instead of a conventional convolution in the encoder–decoder process. The block combines Efficient Multi-Scale Attention to enhance its feature extraction performance. The Multi-Scale Gating Feature Fusion (MGFF) is incorporated into the skip connections, enhancing the fusion of multi-scale features to capture finer crack details while maintaining rich semantic information. Furthermore, we created a pavement crack image dataset named China_MCrack, consisting of 1500 images collected from road surfaces using smartphone-mounted motorbikes. The proposed network was trained and tested on the China_MCrack, DeepCrack, and Crack-Forest datasets, with additional generalization experiments on the BochumCrackDataset. The results were compared with those of the U-Net model, ResUNet, and Attention U-Net. The experimental results show that the proposed MFE-UNet model achieves accuracies of 82.95%, 91.71%, and 69.02% on three datasets, namely, China_MCrack, DeepCrack, and Crack-Forest datasets, respectively, and the F1_score is improved by 1–4% compared with other networks. Experimental results demonstrate that the proposed method is effective in detecting cracks at the pixel level.