2020
DOI: 10.7554/elife.52614
|View full text |Cite
|
Sign up to set email alerts
|

Wikidata as a knowledge graph for the life sciences

Abstract: Wikidata is a community-maintained knowledge base that has been assembled from repositories in the fields of genomics, proteomics, genetic variants, pathways, chemical compounds, and diseases, and that adheres to the FAIR principles of findability, accessibility, interoperability and reusability. Here we describe the breadth and depth of the biomedical knowledge contained within Wikidata, and discuss the open-source tools we have built to add information to Wikidata and to synchronize it with source databases.… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
108
0
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 110 publications
(119 citation statements)
references
References 67 publications
0
108
0
1
Order By: Relevance
“…Wikidata (https://wikidata.org) is a general knowledge base which contains an array of biomedical data sources that have recently been reported (Waagmeester et al, 2020). In contrast to curated knowledge bases such as EpiGraphDB, Wikidata is developed through community driven efforts and bot automation, and incorporates extensive knowledge across a wide array of fields, including (but not limited to) a range of biomedical entities, with duplication and redundancy of entities inevitable.…”
Section: Discussionmentioning
confidence: 99%
“…Wikidata (https://wikidata.org) is a general knowledge base which contains an array of biomedical data sources that have recently been reported (Waagmeester et al, 2020). In contrast to curated knowledge bases such as EpiGraphDB, Wikidata is developed through community driven efforts and bot automation, and incorporates extensive knowledge across a wide array of fields, including (but not limited to) a range of biomedical entities, with duplication and redundancy of entities inevitable.…”
Section: Discussionmentioning
confidence: 99%
“…{ "_id": "43740571" , "_score": 15.594226 , "accession": { "genomic": [ "MN908947.3" , "NC_045512.2" ], "protein": [ "QHD43419.1" , "YP_009724393.1" ] }, "entrezgene": "43740571" , "locus_tag": "GU280_gp05" , "name": "membrane glycoprotein" , "other_names": "membrane glycoprotein" , "refseq": { "genomic": "NC_045512.2" , "protein": "YP_009724393.1" }, "retired": 43560233 , "symbol": "M" , "taxid": 2697049 , "type_of_gene": "protein-coding" } The COVID-19 related pathways from WikiPathways COVID-19 Portal are added to Wikidata using the approach previously described (10) . For this, a dedicated repository has been set up to hold the GPML files, the internal WikiPathways file format, The GPML is converted into RDF files with the WikiPathways RDF generator (36) , while the files with author information are manually edited.…”
Section: Mygeneinfomentioning
confidence: 99%
“…The Gene Wiki project has been tearing down the different research silos on genetics, biological processes, related diseases and associated drugs (10) . In contrast to legacy databases, where data models follow a relational data schema of connected tables, Wikidata ( https://wikidata.org/ ) uses statements to store facts (see Figure 1) (10)(11)(12)(13) .…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…There are numerous studies in the literature that aimed to integrate the available biomedical data [1][2][3][4][5][6][7][8][9][10] . These studies provided useful tools and methods to the life-sciences research community; however, many of them miss important functionalities that prevent them from becoming widely adopted tools/services (Supplementary Information section 1).…”
Section: Mainmentioning
confidence: 99%