2022
DOI: 10.3390/rs14143392
|View full text |Cite
|
Sign up to set email alerts
|

Machine Learning-Based Approach Using Open Data to Estimate PM2.5 over Europe

Abstract: Air pollution is currently considered one of the most serious problems facing humans. Fine particulate matter with a diameter smaller than 2.5 micrometres (PM2.5) is a very harmful air pollutant that is linked with many diseases. In this study, we created a machine learning-based scheme to estimate PM2.5 using various open data such as satellite remote sensing, meteorological data, and land variables to increase the limited spatial coverage provided by ground-monitors. A space-time extremely randomised trees m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
6
2

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(9 citation statements)
references
References 45 publications
1
6
2
Order By: Relevance
“…In this study, we used a tree-based machine learning algorithm called the Extra Trees to estimate PM2.5 over CZ with a high spatial resolution of 1 km during 2018 ̶ 2020. In contrast to our prior study, which concentrated on the entire Europe [31], we discovered that incorporating more balanced data in terms of spatial and temporal distribution enhances the overall accuracy of the model and simplifies the modeling approach. The R2 obtained from the 10-fold cross-validation of the model developed specifically for CZ was 0.85, whereas the corresponding R2 for the model developed for the entire European region was 0.69 [31].…”
Section: Discussioncontrasting
confidence: 86%
See 1 more Smart Citation
“…In this study, we used a tree-based machine learning algorithm called the Extra Trees to estimate PM2.5 over CZ with a high spatial resolution of 1 km during 2018 ̶ 2020. In contrast to our prior study, which concentrated on the entire Europe [31], we discovered that incorporating more balanced data in terms of spatial and temporal distribution enhances the overall accuracy of the model and simplifies the modeling approach. The R2 obtained from the 10-fold cross-validation of the model developed specifically for CZ was 0.85, whereas the corresponding R2 for the model developed for the entire European region was 0.69 [31].…”
Section: Discussioncontrasting
confidence: 86%
“…ET reduces overfitting by introducing additional randomness during the construction of the trees and it uses the entire dataset while training without performing any pruning which decreases the required time for training compared to the RF that applies pruning techniques. A deeper explanation of this algorithm was provided in our previous work [25,31].…”
Section: Model Developmentmentioning
confidence: 99%
“…Spatial open datasets differ in quality and quantity and are used for various types of scientific and practical research in order to be exploited for various purposes, such as, for example, online applications [45], 3D city modelling [46], for collaborative geological mapping [47], for investigating historical settlements and landscape analysis [48], to study the geographic educational paths of individuals or social groups [49], for building citizen science [50], analysis of air pollution [51], vehicular traffic [52], public green spaces [53], etc. Special attention is given to the comparative analysis of open data among cities in compliance with indicator standards in order to set up a common set of indicators [54], such as economy, education, energy, environment and climate change, finance, governance, health, housing, population and social conditions, recreation, safety, solid waste, sport and culture, telecommunications, transportation, urban/local agriculture and food security, urban planning, wastewater and water.…”
Section: Discussionmentioning
confidence: 99%
“…10). Examples are the potential and realized distribution of 16 tree species (Bonannella et al, 2022), monthly airborne fine particulate matter levels (Ibrahim et al, 2022), 43 CORINE land cover classes (Witjes et al, 2022), and daily aerosol optical depth levels (Ibrahim et al, 2021).…”
Section: Future Workmentioning
confidence: 99%