Summary
Because of its small size, low local contrast, and much interference, the field image of fine‐grained equipment taken from power transmission line surveillance is hard to be sustained by the traditional small target detection technique, which requires the manual extraction of features, making it difficult to accurately detect micro‐fine‐grained equipment. The deep learning‐based algorithms have prospective application but require abundant data to guarantee performance and tackle the problem of foreground–background imbalance. This paper develops an effective pipeline, i.e., limited sliding network (LSNet), to detect the small and fine‐grained defects on equipment in power transmission line infrastructure. The model firstly performs the regional analysis on the entire image to determine the potential target locations. The feature extraction and classification on the potential location image blocks are further performed by the VGG‐style model for the dense target locations, and the nonmaximum suppression method is finally applied to locate the target. On the other hand, a specific training method is also developed to better deal with a wide range imbalances of positive and negative samples. The proposed method achieves the detection mean average precision (mAP) rate of 98.66% on the real datasets, while limiting the computational overhead of hardware.