During the process of producing hot‐rolled strips in the metallurgical industry, various defects inevitably appear on its surface due to harsh environments and complex manufacturing, consequently bringing about quality problems and economic loss. However, the existing detection methods are difficult to meet the actual requirements of commercial production due to their problems, such as low efficiency and low accuracy. Herein, an improved You only look once X (YOLOX) model for detecting strip surface defects is proposed. Based on the existing YOLOX model, herein, the MobileViT block is introduced to enhance the capability of feature extraction of the backbone network output. The feature pyramid networks through efficient channel attention (ECA) module to strengthen important channel weights are improved, and finally, the original positioning loss function by efficient intersection over union (EIOU) to increase the locating accuracy is replaced. The experimental results show that the improved YOLOX model can obtain 80.67 mAP and 75.69 mAP detection effects on the Northeast University dataset and Xsteel surface defect dataset, respectively. Compared with the original YOLOX, the model increases by 3.95 mAP and 4.02 mAP, respectively. The data fully show that the improved YOLOX model proposed herein is more effective for strip surface defect detection.