2021
DOI: 10.5194/essd-2020-380
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

AQ-Bench: A Benchmark Dataset for Machine Learning on Global Air Quality Metrics

Abstract: Abstract. With the AQ-Bench dataset, we contribute to the recent developments towards shared data usage and machine learning methods in the field of environmental science. The dataset presented here enables researchers to relate global air quality metrics to easy-access metadata and to explore different machine learning methods for obtaining estimates of air quality based on this metadata. AQ-Bench contains a unique collection of aggregated air quality data from the years 2010–2014 and metadata at more than 55… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
5
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 3 publications
0
5
0
Order By: Relevance
“…The prediction method of O3 concentrations based on machine learning mainly relies on the advantage that the machine learning method can effectively capture the hidden nonlinear characteristics in the change in atmospheric composition and can build a prediction model of atmospheric composition through characteristic variables (Mo et al, 2019;Aljanabi et al, 2020;Amato et al, 2020;Betancourt et al, 2021). Machine learning models trained with data from observations or physical models can produce reliable simulations without intensive high-end computing (Ojha et al, 2021).…”
Section: A C C E P T E D Mmentioning
confidence: 99%
“…The prediction method of O3 concentrations based on machine learning mainly relies on the advantage that the machine learning method can effectively capture the hidden nonlinear characteristics in the change in atmospheric composition and can build a prediction model of atmospheric composition through characteristic variables (Mo et al, 2019;Aljanabi et al, 2020;Amato et al, 2020;Betancourt et al, 2021). Machine learning models trained with data from observations or physical models can produce reliable simulations without intensive high-end computing (Ojha et al, 2021).…”
Section: A C C E P T E D Mmentioning
confidence: 99%
“…The AQ-Bench dataset is available in .csv format at http://doi.org/10.23728/b2share. 30d42b5a87344e82855a486bf2123e9f (Betancourt et al, 2020). To enable a machine learning quick start on the AQ-Bench dataset with reproduction of the baseline experiments, we also provide an introductory Jupyter notebook on https:// gitlab.version.fz-juelich.de/esde/machine-learning/aq-bench (Betancourt et al, 2021).…”
Section: Data and Code Availabilitymentioning
confidence: 99%
“…Our ready-to-use, fully documented dataset is freely available under the DOI https://doi.org/10.23728/b2share. 30d42b5a87344e82855a486bf2123e9f (Betancourt et al, 2020). We also provide our baseline machine learning code at https://gitlab.version.fz-juelich.de/esde/machine-learning/ aq-bench (Betancourt et al, 2021), offering a low-threshold entrance to machine learning in environmental science within a relevant research topic.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Such CTMs can be combined with higher resolution dispersion models to provide local air quality levels in street canyons (Gidhagen et al, 2021 ). More recently, statistical methods, such as Machine Learning (ML) (Barré et al, 2021 ; Betancourt et al, 2021 ; Lovrić et al, 2021 ; Nitheesh et al, 2021 ; Rybarczyk and Zalakeviciute, 2021 ), have proven their efficiency and reliability in predicting the concentrations of pollutants in the atmosphere.…”
Section: Introductionmentioning
confidence: 99%