2022
DOI: 10.5194/gmd-15-509-2022
|View full text |Cite
|
Sign up to set email alerts
|

An aerosol classification scheme for global simulations using the K-means machine learning method

Abstract: Abstract. The K-means machine learning algorithm is applied to climatological data of seven aerosol properties from a global aerosol simulation using EMAC-MADE3. The aim is to partition the aerosol properties across the global atmosphere in specific aerosol regimes; this is done mainly for evaluation purposes. K-means is an unsupervised machine learning method with the advantage that an a priori definition of the aerosol classes is not required. Using K-means, we are able to quantitatively define global aeroso… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 88 publications
0
7
0
Order By: Relevance
“…k-means clustering; Hartigan and Wong, 1979) to identify regions dominated by specific INP types or by possible competition mechanisms between different INPs. This technique has recently been demonstrated for the analysis of global aerosol simulations by Li et al (2022). Following the results presented in this study, cirrus cloud and the resulting climate modifications induced by the ice-nucleating particles described here could be analysed in order to provide further insight into these INP-induced climate effects.…”
Section: Discussionmentioning
confidence: 91%
“…k-means clustering; Hartigan and Wong, 1979) to identify regions dominated by specific INP types or by possible competition mechanisms between different INPs. This technique has recently been demonstrated for the analysis of global aerosol simulations by Li et al (2022). Following the results presented in this study, cirrus cloud and the resulting climate modifications induced by the ice-nucleating particles described here could be analysed in order to provide further insight into these INP-induced climate effects.…”
Section: Discussionmentioning
confidence: 91%
“…Kassomenos et al (2010), in their calculation of the optimal cluster count for the K-means method, stress the volatility of this value based on one's choice of evaluation metric as well as on the method, like trial and error with manual verification. The optimal value found in this study, which is calculated based on the L method, is likely to vary if another method is employed, like the combined use of the sum of squared errors (SSE) with the silhouette coefficients, as was done by Li et al (2022). Another important note on the robustness of the clustered results pertains to the strong dependence of the horizontal transport distributions on the atmospheric circulation, i.e., on the wind field and more specifically on the dominant westerlies in both the Northern and Southern Hemisphere.…”
Section: Discussionmentioning
confidence: 99%
“…One of the challenges of applying clustering techniques is determining the optimal number of clusters, which, in the case of QuickBundles, is determined via the user-defined threshold θ . It is typical to combine the evolution of a sum of squared errors (SSE) function with silhouette coefficients to arrive at the most efficient number of clusters, as has been done in a recent aerosol clustering study with a K-means method (Li et al, 2022). This is a grouping technique that seeks to distribute data points across an optimally chosen number of clusters based on the minimization of a withincluster sum of squares (Hartigan and Wong, 1979).…”
Section: Selection Of An Optimal Threshold θmentioning
confidence: 99%
See 1 more Smart Citation
“…The exact code version used to produce the result of this paper is archived at the German Climate Computing Center (DKRZ) and can be made available to members of the MESSy community upon request. The model setup and the simulation data analysed in this work are available at http://doi.org/10.5281/zenodo.6834299(Beer, 2022).https://doi.org/10.5194/acp-2022-529 Preprint. Discussion started: 4 August 2022 c Author(s) 2022.…”
mentioning
confidence: 99%