We
developed Europe-wide models of long-term exposure to eight
elements (copper, iron, potassium, nickel, sulfur, silicon, vanadium,
and zinc) in particulate matter with diameter <2.5 μm (PM2.5) using standardized measurements for one-year periods between
October 2008 and April 2011 in 19 study areas across Europe, with
supervised linear regression (SLR) and random forest (RF) algorithms.
Potential predictor variables were obtained from satellites, chemical
transport models, land-use, traffic, and industrial point source databases
to represent different sources. Overall model performance across Europe
was moderate to good for all elements with hold-out-validation R-squared ranging from 0.41 to 0.90. RF consistently outperformed
SLR. Models explained within-area variation much less than the overall
variation, with similar performance for RF and SLR. Maps proved a
useful additional model evaluation tool. Models differed substantially
between elements regarding major predictor variables, broadly reflecting
known sources. Agreement between the two algorithm predictions was
generally high at the overall European level and varied substantially
at the national level. Applying the two models in epidemiological
studies could lead to different associations with health. If both
between- and within-area exposure variability are exploited, RF may
be preferred. If only within-area variability is used, both methods
should be interpreted equally.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.