“…Moreover, the datasets from rhizosphere soils were not used in this study because they are extensively and dynamically affected by the plant roots (Zhalnina et al, 2018) and not representative of the soil microbial communities. In total, we collected 1445 datasets as listed in Table S1 (Angle et al, 2017; Bahram et al, 2018; Berkelmann et al, 2020; Black and Just, 2018; Cania et al, 2019; Cha et al, 2021; Chen et al, 2019; Chu et al, 2018; Crits-Christoph et al, 2018; Hartman et al, 2017; Huber et al, 2018; Jiang et al, 2018; Johnston et al, 2016; Li et al, 2018, 2020; Links et al, 2021; Liu et al, 2018; Ma et al, 2018; Neal et al, 2021; Nelkner et al, 2019; Orellana et al, 2018; Ouyang and Norton, 2020; Paungfoo-Lonhienne et al, 2017; Romanowicz et al, 2021; Sukhum et al, 2021; Suttner et al, 2020; Hang Wang et al, 2021; Wang et al, 2020; Woodcroft et al, 2018; Wu et al, 2021; Xiao et al, 2016; Xue et al, 2019; Yu et al, 2019; Yurgel et al, 2019; Zhang et al, 2019; Zheng et al, 2021). The latitude and longitude of each sampling site were obtained from public databases [INSDC BioSamples database (Courtot et al, 2022) and MG-RAST] and verified with the descriptions in each publication.…”