2021
DOI: 10.5334/dsj-2021-015
|View full text |Cite
|
Sign up to set email alerts
|

Implementing a Registry Federation for Materials Science Data Discovery

Abstract: As a result of a number of national initiatives, we are seeing rapid growth in the data important to materials science that are available over the web. Consequently, it is becoming increasingly difficult for researchers to learn what data are available and how to access them. To address this problem, the Research Data Alliance (RDA) Working Group for International Materials Science Registries (IMRR) was established to bring together materials science and information technology experts to develop an internation… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(12 citation statements)
references
References 6 publications
0
12
0
Order By: Relevance
“…We are further developing the registry software to take advantage of metadata extensions and make it more robust to an evolving metadata schema. This is described in more detail in the related Materials Resource Registry architecture paper (Plante et al 2021).…”
Section: The Resource Metadatamentioning
confidence: 99%
See 2 more Smart Citations
“…We are further developing the registry software to take advantage of metadata extensions and make it more robust to an evolving metadata schema. This is described in more detail in the related Materials Resource Registry architecture paper (Plante et al 2021).…”
Section: The Resource Metadatamentioning
confidence: 99%
“…Using a controlled vocabulary provides a number of advantages that simplifies creating records and searching for records. This vocabulary was not meant to exhaustively cover all domains of materials science at all levels; rather, it was intended to assist with discovering high level data resources described in the registry (Plante et al 2021); consequently, it focuses on attributes of data and data service collections rather than individual datasets.…”
Section: The Materials Science Vocabularymentioning
confidence: 99%
See 1 more Smart Citation
“…To make Common Fund data more findable, the CFDE has created a flexible system of data federation that enables users to discover datasets from across the CF at a centralized portal without requiring Common Fund programs to move, reformat, or rehost their data, similar to the federation strategy of the Research Data Alliance (Plante et al, 2021) and The Australian Research Data Commons (Barker, Wilkinson and Treloar, 2019). The CFDE uses a sociotechnical federation system that combines proven, explicitly community driven approaches (Cruz et al, 2019;DeBarry et al, 2020;Plante et al, 2021) with a model driven catalog that incorporates metadata submitted by individual CF Program Data Coordination Centers (DCCs) into a uniform metadata model that can then be indexed and searched from a centralized portal. This uniform Crosscut Metadata Model (C2M2), supports the wide variety of dataset types, vocabularies, and metadata terms used by the individual CF DCCs.…”
Section: Introductionmentioning
confidence: 99%
“…Reusing Common Fund data for new cross-cutting analyses requires expertise in working with large files, accessing data in the cloud, harmonization, and data transformation --all before any scientific analysis can begin. Each stage presents an individually large challenge for a typical biomedical researcher or clinician which motivates labs to hire dedicated bioinformaticians (at considerable cost to NIH); confronting all of these challenges together is prohibitive for integrative analysis.To make Common Fund data more findable, the CFDE has created a flexible system of data federation that enables users to discover datasets from across the CF at a centralized portal without requiring Common Fund programs to move, reformat, or rehost their data, similar to the federation strategy of the Research Data Alliance (Plante et al, 2021) and The Australian Research Data Commons (Barker, Wilkinson and Treloar, 2019). The CFDE uses a sociotechnical federation system that combines proven, explicitly community driven approaches (Cruz et al, 2019;DeBarry et al, 2020;Plante et al, 2021) with a model driven catalog that incorporates metadata submitted by individual CF Program Data Coordination Centers (DCCs) into a uniform metadata model that can then be indexed and searched from a centralized portal.…”
mentioning
confidence: 99%