“…After comparing the results of all three machine learning techniques and the allometric upscaling, we chose the random forest (RF) model because it performed best on cross-validation samples (see section "Building predictive models" below and Supplement, Table S8), and we only used the RF for subsequent mapping and analysis. Using this RF model, we mapped the root biomass of an average tree over an area of ∼ 1 km × 1 km across the globe using as predictors gridded maps of shoot biomass (weight per area) (Santoro et al, 2021(Santoro et al, , 2018, tree height (Simard et al, 2011), soil nitrogen (Wang et al, 2014), pH (Wang et al, 2014), bulk density (Wang et al, 2014), clay content (Wang et al, 2014), sand content (Wang et al, 2014), base saturation (Wang et al, 2014), cation exchange capacity (Wang et al, 2014), water vapor pressure (Fick and Hijmans, 2017), mean annual precipitation (Fick and Hijmans, 2017), mean annual temperature (Fick and Hijmans, 2017), aridity (Trabucco and Zomer, 2019) and water table depth (Fan et al, 2013) (Supplement, Figs. S10-S12).…”