Abstract. Satellite remote sensing aerosol optical depth (AOD) and meteorological
elements were employed to invert PM2.5 (the fine particulate matter
with a diameter below 2.5 µm) in order to control air pollution
more effectively. This paper proposes a restricted gradient-descent linear
hybrid machine learning model (RGD-LHMLM) by integrating a random forest
(RF), a gradient boosting regression tree (GBRT), and a deep neural network
(DNN) to estimate the concentration of PM2.5 in China in 2019. The
research data included Himawari-8 AOD with high spatiotemporal resolution,
ERA5 meteorological data, and geographic information. The results showed
that, in the hybrid model developed by linear fitting, the DNN accounted for
the largest proportion, and the weight coefficient was 0.62. The R2
values of RF, GBRT, and DNN were reported as 0.79, 0.81, and 0.8,
respectively. Preferably, the generalization ability of the mixed model was
better than that of each sub-model, and R2 (determination coefficient)
reached 0.84, and RMSE (root mean square error) and MAE (mean absolute
error) were reported as 12.92 and 8.01 µg m−3, respectively. For the RGD-LHMLM, R2 was above 0.7
in more than 70 % of the sites and RMSE and MAE were below 20
and 15 µg m−3, respectively, in more than 70 % of
the sites due to the correlation coefficient having a seasonal difference
between the meteorological factor and PM2.5. Furthermore, the
hybrid model performed best in winter (mean R2 was 0.84) and worst in
summer (mean R2 was 0.71). The spatiotemporal distribution
characteristics of PM2.5 in China were then estimated and
analyzed. According to the results, there was severe pollution in winter with
an average concentration of PM2.5 being reported as 62.10 µg m−3. However, there was only slight pollution in summer with an average
concentration of PM2.5 being reported as 47.39 µg m−3. The period from 10:00 to 15:00 LT (Beijing time, UTC+8 every day is the best time for
model inversion; at this time the pollution is also high. The findings also
indicate that North China and East China are more polluted than other areas,
and their average annual concentration of PM2.5 was reported as
82.68 µg m−3. Moreover, there was relatively low pollution in
Inner Mongolia, Qinghai, and Tibet, for their average PM2.5
concentrations were reported below 40 µg m−3.