The tasseling stage of maize, as a critical period of maize cultivation, is essential for predicting maize yield and understanding the normal condition of maize growth. However, the branches overlap each other during the growth of maize seedlings and cannot be used as an identifying feature. However, during the tasseling stage, its apical ear blooms and has distinctive features that can be used as an identifying feature. However, the sizes of the maize tassels are small, the background is complex, and the existing network has obvious recognition errors. Therefore, in this paper, unmanned aerial vehicle (UAV) RGB images and an improved YOLOv8 target detection network are used to enhance the recognition accuracy of maize tassels. In the new network, a microscale target detection head is added to increase the ability to perceive small-sized maize tassels; In addition, Spatial Pyramid Pooling—Fast (SPPF) is replaced by the Spatial Pyramid Pooling with Efficient Layer Aggregation Network (SPPELAN) in the backbone network part to connect different levels of detailed features and semantic information. Moreover, a dual-attention module synthesized by GAM-CBAM is added to the neck part to reduce the loss of features of maize tassels, thus improving the network’s detection ability. We also labeled the new maize tassels dataset in VOC format as the training and validation of the network model. In the final model testing results, the new network model’s precision reached 93.6% and recall reached 92.5%, which was an improvement of 2.8–12.6 percentage points and 3.6–15.2 percentage points compared to the mAP50 and F1-score values of other models. From the experimental results, it is shown that the improved YOLOv8 network, with high performance and robustness in small-sized maize tassel recognition, can accurately recognize maize tassels in UAV images, which provides technical support for automated counting, accurate cultivation, and large-scale intelligent cultivation of maize seedlings.