This document provides background for and summarizes main takeaways of a workshop held virtually to kick off the development of community guidelines for consistently curating and representing dataset quality information in a way that is in line with the FAIR principles.
Knowledge about the quality of data and metadata is important to support informed decisions on the (re)use of individual datasets and is an essential part of the ecosystem that supports open science. Quality assessments reflect the reliability and usability of data. They need to be consistently curated, fully traceable, and adequately documented, as these are crucial for sound decision-and policy-making efforts that rely on data. Quality assessments also need to be consistently represented and readily integrated across systems and tools to allow for improved sharing of information on quality at the dataset level for individual quality attribute or dimension. Although the need for assessing the quality of data and associated information is well recognized, methodologies for an evaluation framework and presentation of resultant quality information to end users may not have been comprehensively addressed within and across disciplines. Global interdisciplinary domain experts have come together to systematically explore needs, challenges and impacts of consistently curating and representing quality information through the entire lifecycle of a dataset. This paper GE PENG
Under the auspices of the Earth Science Information Partners (ESIP) and with collaboration among the ESIP Information Quality Cluster (IQC), the Barcelona Supercomputing Center (BSC) Evaluation and Quality Control (EQC) team, and the Australia/New Zealand Data Quality Interest Group (AU/NZ DQIG), a community effort has been undertaken by international Earth Science domain experts. The objective of this effort is to develop global community guidelines with practical recommendations to promote sharing and reusing of quality information at the dataset level, leveraging the experiences and expertise of a team of interdisciplinary domain experts and community best practices. The community guidelines aim to help stakeholders such as science data centers, repositories, data producers and publishers, data managers and stewards, etc., i) to capture and represent quality information of their datasets in a way that is in line with the FAIR guiding principles; ii) to allow for the maximum trust, sharing, reuse and value of their datasets; and iii) to enable global access to and integration of dataset quality information. The vision of developing these guidelines is to promote the creation and use of freely and openly shared dataset quality information that is consistently described, readily available in community standardized formats, and capable of being integrated across commonly-used Earth science systems and tools for search and access with explicitly expressed usage licenses.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.