Fast and accurate SOM estimation and spatial mapping are significant for cultivated land planning and management, crop growth monitoring, and soil carbon pool estimation. It is a key problem to construct a fast and efficient estimation model based on hyperspectral remote sensing image data to realize the inversion mapping of SOM in large areas. In order to solve the problem that the estimation accuracy is not high due to the influence of hyperspectral image quality and soil sample quantity during the estimation model construction, this study explored a method for constructing an estimation model of SOM contents based on a new stacking ensemble learning algorithm and hyperspectral images. Surface soil samples in Huangzhong County of Qinghai Province were collected, and their ZY1-02D hyperspectral remote sensing images were investigated. As input data, a feature band dataset was constructed using the Pearson correlation coefficient and successive projections algorithm. Based on the dataset, a new SOM estimation model under the stacking ensemble learning framework combined with heterogeneous models was developed by optimizing the combination of base and meta-learners. Finally, the spatial distribution map of SOM was plotted based on the result of the model over the study area. The result suggested that the input data quality of the estimation model is improved by constructing a feature band dataset. The multi-class ensemble learning estimation model with the combination strategy of the base and meta-learners has better predictive effects and stability than the single-algorithm and single-level ensemble models with homogeneous learners. The coefficient of determination is 0.829, the residual prediction deviation is 2.85, and the predictive set root mean square error is 1.953. The results can provide new ideas for estimating SOM content using hyperspectral images and ensemble learning algorithms, and serve as a reference for mapping large-scale SOM spatial distribution using space-borne hyperspectral images.