Moritz Lange scite author profile

Abstract. Running large-eddy simulations (LES) can be burdensome and computationally too expensive from the application point-of-view for example to support urban planning. In this study, regression models are used to replicate modelled air pollutant concentrations from LES in urban boulevards. We study the performance of regression models and discuss how to detect situations where the models are applied outside their training domain and their outputs cannot be trusted. Regression models from 10 different model families are trained and a cross-validation methodology is used to evaluate their performance and to find the best set of features needed to reproduce the LES outputs. We also test the regression models on an independent testing dataset. Our results suggest that in general, log-linear regression gives the best and most robust performance on new independent data. It clearly outperforms the dummy model which would predict constant concentrations for all locations (mRMSE of 0.76 vs 1.78 of the dummy model). Furthermore, we demonstrate that it is possible to detect concept drift, i.e., situations where the model is applied outside its training domain and a new LES run may be necessary to obtain reliable results. Regression models can be used to replace LES simulations in estimating air pollutant concentrations, unless higher accuracy is needed. In order to have reliable results, it is however important to do the model and feature selection carefully to avoid over-fitting and to use methods to detect the concept drift.

show abstract

Machine-learning models to replicate large-eddy simulations of air pollutant concentrations along boulevard-type streets

Lange

Suominen

Kurppa

et al. 2021

Geosci. Model Dev.

View full text Add to dashboard Cite

Abstract. Running large-eddy simulations (LESs) can be burdensome and computationally too expensive from the application point of view, for example, to support urban planning. In this study, regression models are used to replicate modelled air pollutant concentrations from LES in urban boulevards. We study the performance of regression models and discuss how to detect situations where the models are applied outside their training domain and their outputs cannot be trusted. Regression models from 10 different model families are trained and a cross-validation methodology is used to evaluate their performance and to find the best set of features needed to reproduce the LES outputs. We also test the regression models on an independent testing dataset. Our results suggest that in general, log-linear regression gives the best and most robust performance on new independent data. It clearly outperforms the dummy model which would predict constant concentrations for all locations (multiplicative minimum RMSE (mRMSE) of 0.76 vs. 1.78 of the dummy model). Furthermore, we demonstrate that it is possible to detect concept drift, i.e. situations where the model is applied outside its training domain and a new LES run may be necessary to obtain reliable results. Regression models can be used to replace LES simulations in estimating air pollutant concentrations, unless higher accuracy is needed. In order to have reliable results, it is however important to do the model and feature selection carefully to avoid overfitting and to use methods to detect the concept drift.

show abstract

Modern Build Automation for an Insurance Company Tool Selection

Lange

Tran

Grunewald

et al. 2023

Procedia Computer Science

View full text Add to dashboard Cite

BImSchG

Altenschmidt¹,

Appel²,

Becher³

et al. 2021

View full text Add to dashboard Cite

Der Berliner Kommentar BImSchG erläutert umfassend und praxisnah sämtliche Vorschriften des BImSchG einschließlich der wichtigsten Regelungen aus dem ausufernden untergesetzlichen Regelwerk. Alles komfortabel versammelt in einem handlichen Werk mit präzisen, klar verständlichen Erläuterungen unter Berücksichtigung der einschlägigen Rechtsprechung – verschaffen Sie sich den Überblick!

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Moritz Lange

§ 6 Die Rolle der nationalen Gerichte im Europarecht

Machine learning models to replicate large-eddy simulations of air pollutant concentrations along boulevard-type streets

Machine-learning models to replicate large-eddy simulations of air pollutant concentrations along boulevard-type streets

Modern Build Automation for an Insurance Company Tool Selection

BImSchG

Contact Info

Product

Resources

About