A high-accuracy, continuous air temperature (Ta) dataset with high spatiotemporal resolution is essential for human health, disease prediction, and energy management. Existing datasets consider factors such as elevation, latitude, and surface temperature but insufficiently address meteorological and spatiotemporal factors, affecting accuracy. Additionally, no high-resolution dataset currently includes daily maximum (T
max
), minimum (T
min
), and mean (T
mean
) temperatures generated using a unified methodology. Here, we introduce the four-dimensional spatiotemporal deep forest (4D-STDF) model, integrating 12 multisource factors, encompassing static and dynamic parameters, and six refined spatiotemporal factors to produce Ta datasets. This approach generates three high-accuracy Ta datasets at 1 km spatial resolution covering mainland China from 2003 to 2022. These datasets, in GeoTIFF format with WGS84 projection, comprise daily T
max
, T
min
, and T
mean
. The overall RMSE are 1.49 °C, 1.53 °C, and 1.18 °C for the estimates. The 4D-STDF model can also be applied to other regions with sparse meteorological stations.