Visual interpretation is used to identify and extract collapses on remote sensing images time-consuming and laborious. In order to solve this problem, this paper takes Liuhe County, Jilin Province, as an example, and adopts a deep learning approach to achieve automatic extraction of collapse using high-resolution remote sensing images. The classical semantic segmentation models such as DeepLabV3+, PSP-Net and U-Net built with Pytorch framework are used, and after adjusted the models and modified the hyperparameters to make them fit the collapse extraction task, a high-definition collapse remote sensing dataset has been built for training and validation. The U-Net model, which backbone is VGG-16, was found to have the best extraction effect after comparison. In order to extract the collapses in remote sensing image slices more accurately, a Pyramid U-Net model for collapse extraction is proposed by integrating the pyramid pooling module on the basis of U-Net model to enhance the multi-scale feature extraction capability of the model. The results show that the Mean Intersection over Union (MIoU) and Mean Pixel Accuracy (MPA) of thePyramid U-Net model proposed in this paper reach 86.68% and 93.11%, respectively. Compared with the traditional U-Net model, Pyramid U-Net improves the MIoU and MPA values by 0.78% and 0.15%, respectively, and further improve the effect of collapse extraction.