The prediction of air pollutants has always been an issue of great concern to the whole of society. In recent years, the prediction and simulation of air pollutants via machine learning have been widely used. In this study, we collected meteorological data and tropospheric NO2 column concentration data in Beijing, China, between 2012 and 2020, and compared the two methods of time sequence-based and influencing factor-based random forest regression in predicting the tropospheric NO2 column concentration. The results showed that prediction of the tropospheric NO2 column concentration using random forest regression was affected by the changes of human activities, especially emergency events and policy variations. The advantage of time sequence analysis lies in its ability to calculate the distribution of air pollutants with a long-time scale of prediction, but it may produce large errors in numerical value. The advantage of influencing factor prediction lies in its high precision and that it can identify the specific impact of each influencing factor on the NO2 column concentration, but it needs more data and work quantities before it can make a prediction about the future.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.