Based on the medium-resolution Landsat TM and OLI satellite images in the study area, the deep learning ENVINet-5 model is adopted for vegetation coverage monitoring. By referring to the fusion image and Google Earth high-resolution satellite image, the training samples and verification samples are manually labeled, and the labels of four types of ground objects (desert, water body, cultivated land, and construction land) are made. Through the ENVI deep learning binary classification model, the labeled training samples are trained, and a large number of samples of desert, water, and cultivated land are extracted and transformed into corresponding label images. Then, a large number of training sample labels extracted from the model are combined with the manually made construction land sample labels and both of them are used as the training samples of the ENVI deep learning multiclassification model. According to the classification process of the deep learning model (creating label image, initializing training model, and training model and model classification), through the adjustment of various parameters, the four types of ground objects in the study area are finally classified. Finally, the classification results that meet the accuracy requirements are statistically analyzed. It is proved that the model classification results can meet the use requirements.