Water-soluble organic matter (WSOM) formed through aqueous
processes
contributes substantially to total atmospheric aerosol, however, the
impact of water evaporation on particle concentrations is highly uncertain.
Herein, we present a novel approach to predict the amount of evaporated
organic mass induced by sample drying using multivariate polynomial
regression and random forest (RF) machine learning models. The impact
of particle drying on fine WSOM was monitored during three consecutive
summers in Baltimore, MD (2015, 2016, and 2017). The amount of evaporated
organic mass was dependent on relative humidity (RH), WSOM concentrations,
isoprene concentrations, and NO
x
/isoprene
ratios. Different models corresponding to each class were fitted (trained
and tested) to data from the summers of 2015 and 2016 while model
validation was performed using summer 2017 data. Using the coefficient
of determination (R
2) and the root-mean-square
error (RMSE), it was concluded that an RF model with 100 decision
trees had the best performance (R
2 of
0.81) and the lowest normalized mean error (NME < 1%) leading to
low model uncertainties. The relative feature importance for the RF
model was calculated to be 0.55, 0.2, 0.15, and 0.1 for WSOM concentrations,
RH levels, isoprene concentrations, and NO
x
/isoprene ratios, respectively. The machine learning model was thus
used to predict summertime concentrations of evaporated organics in
Yorkville, Georgia, and Centerville, Alabama in 2016 and 2013, respectively.
Results presented herein have implications for measurements that rely
on sample drying using a machine learning approach for the analysis
and interpretation of atmospheric data sets to elucidate their complex
behavior.