Abstract.Comparing model output and observed data is an important step for assessing model performance and quality of simulation results. However, such comparisons are often hampered by differences in spatial scales between local point observations and large-scale simulations of grid-cells or pixels. In this study, we propose a generic approach for a pixel-topoint comparison that accounts for the uncertainty resulting from landscape variability and measurement errors in ecosystem 30 variables, and provide statistical measures. The basic concept of our approach is to determine the statistical properties of small-scale (within-pixel) variability and observational errors, and to use this information to correct for their effect when large-scale area averages (pixel) are compared to small-scale point estimates. We demonstrate our approach by comparing simulated values of aboveground biomass, woody productivity (woody net primary productivity, NPP) and residence time of woody biomass from four dynamic global vegetation models (DGVMs) with measured inventory data from permanent plots 35 in the Amazon rainforest, a region with the typical problem of low data availability, a scale mismatch and high model uncertainty. We find that the DGVMs under-and overestimate aboveground biomass by 25% and up to 60%, respectively.Geosci. Model Dev. Discuss., https://doi.org/10.5194/gmd-2018-182 Manuscript under review for journal Geosci. Model Dev. Discussion started: 27 July 2018 c Author(s) 2018. CC BY 4.0 License.
2Our comparison metrics provide a quantitative measure for model-data agreement and show moderate to good agreement with the region-wide spatial biomass pattern detected by plot observations. However, all four DGVMs overestimate woody productivity and underestimate residence time of woody biomass even when accounting for the large uncertainty range of the observational data. This is because DGVMs do not represent the relation between productivity and residence time of woody biomass correctly. Thus, the DGVMs may simulate the correct large-scale patterns of biomass but for the wrong 5 reasons. We conclude that more information about the underlying processes driving biomass distribution are necessary to improve DGVMs. Our approach provides robust statistical measures for any pixel-to-point comparison, which is applicable for evaluation of models and remote sensing products.