2018
DOI: 10.3897/bdj.6.e27539
|View full text |Cite
|
Sign up to set email alerts
|

Liberating links between datasets using lightweight data publishing: an example using plant names and the taxonomic literature

Abstract: Constructing a biodiversity knowledge graph will require making millions of cross links between diversity entities in different datasets. Researchers trying to bootstrap the growth of the biodiversity knowledge graph by constructing databases of links between these entities lack obvious ways to publish these sets of links. One appealing and lightweight approach is to create a "datasette", a database that is wrapped together with a simple web server that enables users to query the data. Datasettes can be packag… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3

Relationship

4
2

Authors

Journals

citations
Cited by 7 publications
(9 citation statements)
references
References 24 publications
0
7
0
Order By: Relevance
“…Other efforts to produce domain-specific KGs were published in a variety of areas, including biodiversity (71)(72)(73), the microbiome (74), and for the purpose of enriching clinical data (75,76). The articles on biodiversity focused specifically on how a KG could be created and linked to identifiers in the literature (71,72) or other important biodiversity resources (73).…”
Section: Constructing Knowledge Graphsmentioning
confidence: 99%
See 1 more Smart Citation
“…Other efforts to produce domain-specific KGs were published in a variety of areas, including biodiversity (71)(72)(73), the microbiome (74), and for the purpose of enriching clinical data (75,76). The articles on biodiversity focused specifically on how a KG could be created and linked to identifiers in the literature (71,72) or other important biodiversity resources (73).…”
Section: Constructing Knowledge Graphsmentioning
confidence: 99%
“…Other efforts to produce domain-specific KGs were published in a variety of areas, including biodiversity (71)(72)(73), the microbiome (74), and for the purpose of enriching clinical data (75,76). The articles on biodiversity focused specifically on how a KG could be created and linked to identifiers in the literature (71,72) or other important biodiversity resources (73). In contrast, papers using KGs for clinical enrichment aimed to use KGs as way to link clinical data to sources of evidence to provide support to clinical observations (76)(77)(78) or to help make the data more interpretable with respect to underlying biological mechanism(s) (75) for improved diagnosis (79).…”
Section: Constructing Knowledge Graphsmentioning
confidence: 99%
“…As part of ongoing work to match citation strings for taxonomic names to persistent identifiers for the corresponding publications ( [10]), I developed a set of scripts to matching the text string citations to digital identifiers such as DOIs, Handles, JSTOR links, etc. The difficult part of this work is mapping the page-level citations stored in IPNI to work-level bibliographic data.…”
Section: Publicationsmentioning
confidence: 99%
“…However, as yet we don't have a comprehensive bibliography of life ( [6]), hence much of the work in making the mapping involves scouring the web for sources of bibliographic information in the hope that these will include works containing the IPNI citations. This mapping between IPNI names is periodically uploaded to a GitHub repository 4 , and has also been published as a "datasette" [10]. For this project I took this mapping and exported it in RDF.…”
Section: Publicationsmentioning
confidence: 99%
“…This is just a hint about how exploring Full-size  DOI: 10.7717/peerj-cs.335/fig- 1 the nanopublication relation network could lead to finding related concepts and assertions that might not be explicitly connected in the scientific literature and databases. Nonetheless, despite these premises, nanopublications are not widely used by scientists outside specific circles (Page, 2018); they are hard to find and rarely cited. Nanopublications rarely have a human-readable accessible version and cannot be searched via keywords or natural language queries.…”
Section: Introductionmentioning
confidence: 99%