2022
DOI: 10.37044/osf.io/n7qku
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Enhancement and Reusage of Biomedical Knowledge Graph Subsets

Abstract: Knowledge Graphs (KGs) such as Wikidata act as a hub of information from multiple domains and disciplines, and is crowdsourced by multiple stakeholders. The vast amount of available information makes it difficult for researchers to manage the entire KG, which is also continually being edited. It is necessary to develop tools that extract subsets for domains of interest. These subsets will help researchers to reduce costs and time, making data of interest more accessible. In the last two BioHackathons (BH20, BH… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
0
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(5 citation statements)
references
References 0 publications
0
0
0
Order By: Relevance
“…The RDF serializer in WDumper and WDSub is the same; however, the WDSub filtering system (based on ShEx) can parse quite complex 25 https://github.com/kg-subsetting/paper-wikidata-subsetting-2023/blob/dc2869c/performance-experiments/count_instances_tsv.py -accessed 10 June 2023. 26 filters at the SPARQL level, which creates a massive overhead. WDumper also has a better level of multithreading than WDSub.…”
Section: Performance Test Resultsmentioning
confidence: 99%
See 4 more Smart Citations

Wikidata subsetting: Approaches, tools, and evaluation

Hosseini Beghaeiraveri,
Labra Gayo,
Waagmeester
et al. 2023
SW
Self Cite
“…The RDF serializer in WDumper and WDSub is the same; however, the WDSub filtering system (based on ShEx) can parse quite complex 25 https://github.com/kg-subsetting/paper-wikidata-subsetting-2023/blob/dc2869c/performance-experiments/count_instances_tsv.py -accessed 10 June 2023. 26 filters at the SPARQL level, which creates a massive overhead. WDumper also has a better level of multithreading than WDSub.…”
Section: Performance Test Resultsmentioning
confidence: 99%
“…SPARQL endpoints are usually slow and have run-time restrictions. Moreover, recursive data models are not supported in standard SPARQL implementations [20].…”
Section: What Is a Subset?mentioning
confidence: 99%
See 3 more Smart Citations

Wikidata subsetting: Approaches, tools, and evaluation

Hosseini Beghaeiraveri,
Labra Gayo,
Waagmeester
et al. 2023
SW
Self Cite