Carbon dioxide (CO2) is currently the most harmful greenhouse gas in the atmosphere. Obtaining long-term, high-resolution atmospheric column CO2 concentration (XCO2) datasets is of great practical significance for mitigating the greenhouse effect, identifying and controlling carbon emission sources, and achieving carbon cycle management. However, mainstream satellite observations provide XCO2 datasets with coarse spatial resolution, which is insufficient to support the needs of higher-precision research. To address this gap, in this study, we integrate spatial information with the extreme random trees model and develop a new machine learning model called spatial extreme random trees (SExtraTrees) to reconstruct a 1 km spatial resolution XCO2 dataset for China from 2016 to 2020. The results indicate that the predictive ability of spatial extreme random trees is more stable and has higher fitting accuracy compared to other methods. Overall, XCO2 in China shows an increasing trend year by year, with the spatial distribution revealing significantly higher XCO2 levels in eastern coastal regions compared to western inland areas. The contributions of this study are primarily in the following areas: (1) Considering the spatial heterogeneity of XCO2 and combining spatial features with the advantages of machine learning, we construct the spatial extreme random trees model, which is verified to have high predictive accuracy. (2) Using the spatial extreme random trees model, we reconstruct high-resolution XCO2 datasets for China from 2016 to 2020, providing data support for carbon emission reduction and related decision making. (3) Based on the generated dataset, we analyze the spatiotemporal distribution patterns of XCO2 in China, thereby improving emission reduction policies and sustainable development measures.