2022
DOI: 10.1093/bioinformatics/btac735
|View full text |Cite
|
Sign up to set email alerts
|

Gene set proximity analysis: expanding gene set enrichment analysis through learned geometric embeddings, with drug-repurposing applications in COVID-19

Abstract: Motivation Gene set analysis methods rely on knowledge-based representations of genetic interactions in the form of both gene set collections and protein-protein interaction (PPI) networks. However, explicit representations of genetic interactions often fail to capture complex interdependencies among genes, limiting the analytic power of such methods. Results We propose an extension of gene set enrichment analysis to a latent… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 8 publications
(9 citation statements)
references
References 49 publications
0
9
0
Order By: Relevance
“…In 22 out of 31 cases (71% datasets), ANDES set enrichment outperforms the hypergeometric test for KEGG pathway identification (p<7e-4, Wilcoxon rank sum test, Figure 3A). For the rank-based version, we compare ANDES with GSEA [27] as well as a new embedding-based gene set enrichment method, GSPA [29]. Aggregating performance across all 42 datasets in the benchmark, ANDES’ rank-based gene set enrichment significantly improves over both GSEA (p=0.041; Wilcoxon rank sum test) and GSPA (p=0.028; Wilcoxon rank sum test) (Figure 3B).…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…In 22 out of 31 cases (71% datasets), ANDES set enrichment outperforms the hypergeometric test for KEGG pathway identification (p<7e-4, Wilcoxon rank sum test, Figure 3A). For the rank-based version, we compare ANDES with GSEA [27] as well as a new embedding-based gene set enrichment method, GSPA [29]. Aggregating performance across all 42 datasets in the benchmark, ANDES’ rank-based gene set enrichment significantly improves over both GSEA (p=0.041; Wilcoxon rank sum test) and GSPA (p=0.028; Wilcoxon rank sum test) (Figure 3B).…”
Section: Resultsmentioning
confidence: 99%
“… (A) Performance comparison between ANDES and hypergeometric test in retrieving annotated KEGG terms using genes that have FDR ≤ 0.05 in each dataset (where there are at least 10 genes that are significantly differentially expressed). (B) Performance comparison between ANDES, GSEA [27], and GSPA [29] in retrieving annotated KEGG terms using the full list of genes (no FDR cutoff), ranked by log 2 (fold change). In both cases, ANDES statistically outperforms other methods, demonstrating the advantage of incorporating gene embedding information using best-match into the gene set enrichment setting.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…While such leverages of network topology in annotation enrichment analyses have been used before, [5]–[7], few approaches have used networks to disentangle convoluted cellular mechanisms in annotation analyses. If the studied gene set involves genes from two or more moderately independent cellular processes, the signal is obscured, and false negative identifications will be more likely.…”
Section: Methodsmentioning
confidence: 99%