High‐resolution (HR) crack images offer more detailed information for assessing structural conditions compared to low‐resolution (LR) images. This wealth of detail proves indispensable in bolstering the safety of unmanned aerial vehicle (UAV)‐based inspection procedures and elevating the precision of small crack segmentation. Nonetheless, achieving a balance between segmentation accuracy and GPU memory consumption poses a substantial challenge for deep learning models when processing HR crack images. To overcome this challenge, a novel “HR crack segmentation framework” (HRCSF) is proposed, specifically designed to meticulously segment crack images with resolutions exceeding 4K. First, a multiscale crack feature extraction network (MsCFEN) was proposed with the embedment of the strip pooling operation to enhance the representation of the transverse and longitudinal crack pixels from the complex backgrounds. Subsequently, two cascaded operations were tailored to MsCFEN, enabling a comprehensive refinement process that incorporates both global and local aspects. Furthermore, to fully leverage the potential of each proposed component in the refinement process, the complete architecture was trained using a loss function with embedded boundary optimization. Conclusively, a UAV‐based case study was conducted on a real bridge in Changsha, demonstrating HRCSF's practicability in segmenting HR crack images. The implementation of HRCSF allows the UAV to perform crack inspection effectively from a distance of 3 m away from the girder, resulting in a significant 50% reduction in inspection time compared to LR segmentation methods while maintaining high detection accuracy.