Image detection technology is of paramount importance across various fields. This significance is not only seen in general images with everyday scenes but also holds substantial research value in the field of remote sensing. Remote sensing images involve capturing images from aircraft or satellites. These images typically feature diverse scenes, large image formats, and varying imaging heights, thus leading to numerous small-sized targets in the captured images. Accurately identifying these small targets, which may occupy only a few pixels, is a challenging and active research area. Current methods mainly fall into two categories: enhancing small target features by improving resolution and increasing the number of small targets to bolster training datasets. However, these approaches often fail to address the core distinguishing features of small targets in the original images, thus resulting in suboptimal performance in fine-grained classification tasks. To address this situation, we propose a new network structure DDU (Downsample Difference Upsample), which is based on differential and resolution changing methods in the Neck layer of deep learning networks to enhance the recognition features of small targets, thus further improving the feature richness of recognition and effectively solving the problem of low accuracy in small target object recognition. At the same time, in order to take into account the recognition effect of targets of other sizes in the image, a new attention mechanism called PNOC (protecting the number of channels) is proposed, which integrates small target features and universal object features without losing the number of channels, thereby increasing the accuracy of recognition. And experimental verification was conducted on the PASCAL-VOC dataset. At the same time, it was applied to the testing of the fine-grained MAR20 dataset and found that the performance was better than other classic algorithms. At the same time, because the proposed framework belongs to a one-stage detection method, it has good engineering applicability and scalability, and universality in scientific research applications are good. Through comparative experiments, it was found that our algorithm improved the performance of the mAP by 0.7% compared to the original YOLOv8 algorithm.