2021
DOI: 10.1371/journal.pone.0246099
|View full text |Cite
|
Sign up to set email alerts
|

Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs?

Abstract: The increasing amount of publicly available research data provides the opportunity to link and integrate data in order to create and prove novel hypotheses, to repeat experiments or to compare recent data to data collected at a different time or place. However, recent studies have shown that retrieving relevant data for data reuse is a time-consuming task in daily research practice. In this study, we explore what hampers dataset retrieval in biodiversity research, a field that produces a large amount of hetero… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
27
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 30 publications
(28 citation statements)
references
References 49 publications
0
27
0
1
Order By: Relevance
“…For brevity, we assume that datasets of interest have already been identified. However, searchability of data repositories is an active area of research (Pampel et al, 2013;Löffler et al, 2021). We briefly review several common approaches to data acquisition, harmonization, curation, and publication.…”
Section: Current Database Pipelinementioning
confidence: 99%
See 1 more Smart Citation
“…For brevity, we assume that datasets of interest have already been identified. However, searchability of data repositories is an active area of research (Pampel et al, 2013;Löffler et al, 2021). We briefly review several common approaches to data acquisition, harmonization, curation, and publication.…”
Section: Current Database Pipelinementioning
confidence: 99%
“…Soil data uses include a broad range of applications such as ecology, biogeochemistry (Iversen et al, 2017;Wieder et al, 2021b), soil engineering, soil taxonomy and classification, geochemistry (Nave et al, 2016;Hengl et al, 2017;Lawrence et al, 2020), micrometeorology (Cheah et al, 2018), agronomy (Lyons et al, 2020), and geomorphology. Datasets, defined as "a collection of scientific data including primary data and metadata organized and formatted for a particular purpose" (Löffler et al, 2021), are assembled by an equally diverse range of organizations. These organizations include government agencies, academic collaborations, nongovernmental organizations, and industry, reflecting a wide range of generators and users including farmers, land managers, students, technicians, scientists, and policy makers.…”
Section: Introductionmentioning
confidence: 99%
“…Most of this work is so far focused on improving the FAIRness of biodiversity data. It includes work on improvement of discoverability of data by better, semantic descriptions (Löffler et al 2021, Pfaff et al 2017). These investigations have shown which categories of concepts (e.g., organism, environment, process, event) are relevant to biodiversity research.…”
Section: Preliminary Work: Biodiversity Informatics and Semantic Webmentioning
confidence: 99%
“…However, practical guidance on semantic annotation of biodiversity literature are few and far between and usually refer to English-language text corpora with a focus on taxonomy (see, e.g., Sautter et al, 2007). Beyond mere taxonomic tagging, more recent workflows also cover a much broader thematic range of biodiversity entities, but do not allow multilabel annotation, that is, (possibly) assigning more than one annotation tag to an annotation unit (Löffler et al, 2020;Nguyen et al, 2019;Thessen et al, 2018). Enhancing content retrieval and information fusion by multi-label annotation has since found its way into the biomedical domain, e.g.…”
Section: Introductionmentioning
confidence: 99%