Aiming at the real-time perception problem of key target objects caused by harsh environmental factors of high dust, low illumination, motion blur, and multi-target mixing in the comprehensive excavation working face of coal mine, a multi-target detection and tracking algorithm based on DDEB-YOLOv5s and StrongSORT is proposed. First, the YOLOv5s model is improved by using C3-Dense module, decoupled head, ECIoU loss function, and weighted bi-directional feature pyramid network to enhance the detection performance of the model in complex backgrounds of coal mine and complete the design of the DDEB-YOLOv5s multi-target detection network. Second, the DDEB-YOLOv5s algorithm is used as a detector and combined with the StrongSORT tracking algorithm to track critical equipment and miners in the complex context of coal mine. Experiments were conducted on the dataset of comprehensive excavation working face, and the experimental results show that the proposed DDEB-YOLOv5s has the best integrated detection performance compared with other YOLO series target detection algorithms, and its mean value of detection accuracy reaches 91.7%, which is 4.9% higher than that of the original YOLOv5s model. In addition, compared to the three tracking models, (YOLOv7-tiny)-(BoT-SORT), YOLOv5s-DeepSORT, and YOLOv8s-Bytetrack, the (DDEB-YOLOv5s)-StrongSORT model has the best tracking performance (with a mean tracking accuracy of 94.2%) and the least number of identifier switches. Finally, the real-time perception method proposed in this study for the key target of the coal mine working face can provide new technical support and effective guarantee for coal mine safety production.