“…The causes of multi-model diversity highlighted in previous studies (Young et al, 2018;Mortier et al, 2020;Griffiths et al, 2021) can also be elucidated using machine learning. There is an increase in the availability of globally gridded fused model-observation data products (e.g., Randles et al, 2017;Buchard et al, 2017;Inness et al, 2019;Betancourt et al, 2021;van Donkelaar et al, 2021;Betancourt et al, 2022) that can be used as benchmarks in model evaluation of atmospheric composition. Novel aspects of such benchmarks include providing data relevant to health impacts (e.g., DeLang et al, 2021) and using machine learning techniques for global mapping of atmospheric composition (e.g., Betancourt et al, 2022).…”