2022
DOI: 10.1101/2022.05.11.491490
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Integrating and formatting biomedical data as pre-calculated knowledge graph embeddings in the Bioteque

Abstract: Biomedical data is accumulating at a fast pace and integrating it into a unified framework is a major challenge, so that multiple views of a given biological event can be considered simultaneously. Here we present the Bioteque, a resource of unprecedented size and scope that contains pre-calculated biomedical embeddings derived from a gigantic knowledge graph, displaying more than 450 thousand biological entities and 30 million relationships between them. The Bioteque integrates, harmonizes and formats data co… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(4 citation statements)
references
References 88 publications
(121 reference statements)
0
4
0
Order By: Relevance
“…Thus, RD diagnosis can become a “needle in a haystack” task where the solution is still far connected to current knowledge. In consequence, to increase their efficiency in difficult cases, gene-disease prediction methods are encouraged to use new and various types of functional annotations and combine them accurately (15, 16, 61, 62). In this study, we first confirmed that different types of gene-gene functional association networks had different capabilities in recovering known genes associated with a large collection of RDs.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…Thus, RD diagnosis can become a “needle in a haystack” task where the solution is still far connected to current knowledge. In consequence, to increase their efficiency in difficult cases, gene-disease prediction methods are encouraged to use new and various types of functional annotations and combine them accurately (15, 16, 61, 62). In this study, we first confirmed that different types of gene-gene functional association networks had different capabilities in recovering known genes associated with a large collection of RDs.…”
Section: Discussionmentioning
confidence: 99%
“…Gene annotations not represented as gene or protein relationships were transformed into gene-gene co-annotation networks, describing genes (nodes) and their functional relationship (edges). Thus, for networks describing human gene-phenotype similarity using Human Phenotype Ontology (HPO) terms (32) and phenotype similarity using mouse orthologs from the Mouse Genome Informatics (MGI) (16) we calculated Jaccard similarity for each pair of genes sharing at least one HPO term and constructed a null distribution of Jaccard values to compute z-scores. Significant gene interactions (z-score>1.96; p-values<0.05) were selected to generate the network of phenotypes.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations