Grassland classification is crucial for grassland management. One commonly used method utilizes remote sensing vegetation indices (VIs) to map grassland classes at various scales. However, most grassland classifications were conducted as case studies in a small area due to lack of field data sources. At a small scale, classification is reliable; however, great uncertainty emerges when extended to other areas. In this study, large amounts of field observations (more than 30,000 aerial photos) were obtained using unmanned aerial vehicle photography in Inner Mongolia, China, during the peak period of grassland growth in 2018 and 2019. Then, four machine learning classification algorithms were constructed based on characteristic indices of MODIS NDVI in the growing season to map grassland classes of Inner Mongolia. Finally, the spatial distribution and temporal variation of temperate grassland classes were analyzed. Results showed that: (1) Among all characteristic indices, the maximum, average, and sum of MODIS NDVI from July to September during 2015 to 2019 greatly affected grassland classification. (2) The random forest method exhibited the best performance with overall accuracy and kappa coefficient being 72.17% and 0.62, respectively. (3) Compared with the grassland class mapped in the 1980s, 30.98% of grassland classes have been transformed. Our study provides a technological basis for effective and accurate classification of the temperate steppe class and a theoretical foundation for sustainable development and restoration of the temperate steppe ecosystem.