2022
DOI: 10.1162/dint_a_00131
|View full text |Cite
|
Sign up to set email alerts
|

HPC-oriented Canonical Workflows for Machine Learning Applications in Climate and Weather Prediction

Abstract: Machine learning (ML) applications in weather and climate are gaining momentum as big data and the immense increase in High-performance computing (HPC) power are paving the way. Ensuring FAIR data and reproducible ML practices are significant challenges for Earth system researchers. Even though the FAIR principle is well known to many scientists, research communities are slow to adopt them. Canonical Workflow Framework for Research (CWFR) provides a platform to ensure the FAIRness and reproducibility of these … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 30 publications
0
2
0
Order By: Relevance
“…In order to address the challenges with respect to data mentioned above, current efforts at DKRZ (German Climate Computing Center) are aimed at a complete restructuring of the way research is performed in simulation-based climate research (Anders et al 2022, Mozaffari et al 2022, Weigel et al 2020. DKRZ is perfectly suited for this endeavor, because researchers have the resources and services available to conduct the entire suite of their data-intensive workflows -ranging from planning and setting up of model simulations, analyzing the model output, reusing existing large-volume datasets to data publication and long-term archival.…”
Section: Cultural Changementioning
confidence: 99%
“…In order to address the challenges with respect to data mentioned above, current efforts at DKRZ (German Climate Computing Center) are aimed at a complete restructuring of the way research is performed in simulation-based climate research (Anders et al 2022, Mozaffari et al 2022, Weigel et al 2020. DKRZ is perfectly suited for this endeavor, because researchers have the resources and services available to conduct the entire suite of their data-intensive workflows -ranging from planning and setting up of model simulations, analyzing the model output, reusing existing large-volume datasets to data publication and long-term archival.…”
Section: Cultural Changementioning
confidence: 99%
“…At the beginning, all data mining samples are in the root node, and using a recursive approach, the samples are divided by specified attributes, and a corresponding number of branches are developed based on the attribute value, which must be discrete, or if the attribute value is continuous, it must first be discretized by processing this attribute value to make it discrete [11][12]. In the second stage of decision tree pruning, it is to cut out the isolated points and noise generated in the training data, so that the constructed decision tree can be compared one by one with each attribute value of the sample data, so as to achieve the purpose of classifying the unknown sample data and mining it [13].…”
Section: Decision Treesmentioning
confidence: 99%