Due to the limitations of the operating conditions and equipment status, in practice, the normal data for planetary gearboxes are often sufficient, but the fault data are scarce and difficult to obtain, causing the fault diagnosis to face not only an imbalance problem but also a small sample problem. It leads to serious misdiagnosis and poor generalizability. To overcome above challenges, an enhanced Siamese network with improved downsampling module is proposed to reduce the negative impact of small and imbalanced datasets on fault diagnosis. Firstly, the Siamese network is combined with data enhancement to adaptively increase the fault training data with different small sample degrees. It can make the number of training data of each class get rid of small sample, while avoiding the excessive use of scarce fault data causing additional overfitting, so as to improve the generalization of the model. Among them, the data enhancement adopts the extreme noise expansion method proposed in this work, which can quickly generate samples that meet the requirements and help to improve the performance of the model in noisy environments. Secondly, adding the improved downsampling module to the Siamese network. The improved downsampling module introduces density-based spatial clustering of applications with noise (DBSCAN) and distribution ratio calculation, which can consider the distribution range and density of data when selecting. It can not only balance the training data, but also avoid serious information loss, and then reduce the probability of misdiagnosis. Finally, using six imbalanced planetary gearbox datasets with different small sample degrees for verification. The experimental results show that the proposed method can efficiently increase and balance the training data, which greatly improves the accuracy of fault diagnosis under the condition of small and imbalanced samples, especially when the small sample degree is very serious, the accuracy is improved by 57.76% on average.