Abstract. Advancements in remote sensing technology have significantly contributed to the improvement of models for estimating terrestrial gross primary productivity (GPP). However, discrepancies in the spatial distribution and interannual variability within GPP datasets pose challenges to a comprehensive understanding of the terrestrial carbon cycle. In contrast to previous models that rely on remote sensing and environmental variables, we developed an ensemble model based on the random forest method (denoted ERF model). This model used GPP outputs from established models: Eddy Covariance Light Use Efficiency (EC-LUE), GPP estimate model based on Kernel Normalized Difference Vegetation Index (GPP-kNDVI), GPP estimate model based on Near-Infrared Reflectance of Vegetation (GPP-NIRv), Revised-EC-LUE, Vegetation Photosynthesis Model (VPM), and GPP estimate model based on the Moderate Resolution Imaging Spectroradiometer (MODIS). These outputs were used as inputs to estimate GPP. The ERF model demonstrated superior performance, explaining 85.1 % of the monthly GPP variations at 170 sites and surpassing the performance of selected GPP estimate models (67.7 %–77.5 %) and an independent random forest model using remote sensing and environmental variables (81.5 %). Additionally, the ERF model improved accuracy across each month and with various subranges, mitigating the issue of “high-value underestimation and low-value overestimation” in GPP estimates. Over the period from 2001 to 2022, the global GPP estimated by the ERF model was 132.7 PgC yr−1, with an increasing trend of 0.42 PgC yr−2, which is comparable to or slightly better than the accuracy of other mainstream GPP datasets in terms of validation results of GPP observations independent of FLUXNET (i.e., ChinaFLUX). Importantly, for a growing number of GPP datasets, our study provides a way to integrate these GPP datasets, which may lead to a more reliable estimate of global GPP.