2020
DOI: 10.1101/2020.07.15.205575
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Highly Accurate Cancer Phenotype Prediction with AKLIMATE, a Stacked Kernel Learner Integrating Multimodal Genomic Data and Pathway Knowledge

Abstract: Advancements in sequencing have led to the proliferation of multi-omic profiles of human cells under different conditions and perturbations. In addition, several databases have amassed information about pathways and gene "signatures" -- patterns of gene expression associated with specific cellular and phenotypic contexts. An important current challenge in systems biology is to leverage such knowledge about gene coordination to maximize the predictive power and generalization of models applied to high-throughp… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 81 publications
0
2
0
Order By: Relevance
“…The approach using predicted functional information proved to be more useful in this context, but more approaches and sources of information can also be incorporated with a focus on prioritizing biologically related genomic regions. Moreover, knowledge from multiple heterogeneous sources can be combined to further pinpoint potential QTLs, termed as poly-omics GP models (Wheeler et al, 2014 ; Uzunangelov et al, 2020 ). These information sources may include (i) predicted variants effects, (ii) gene functions e.g., GO, COEX, (iii) networks of gene-gene and protein-protein interactions, stored in public resources like STRING (Mering et al, 2003 ), GeneMANIA (Warde-Farley et al, 2010 ); (iv) pathways, in which genes are grouped e.g., KEGG (Kanehisa and Goto, 2000 ); (v) previously generated GWAS and QTL results which indicate involvement of particular regions for specific traits e.g., AraGWAS (Togninalli et al, 2020 ), AraQTL (Nijveen et al, 2017 ), (vi) known connections to phenotypes and (vii) endophenotypes, usually measured using -omics data at different stages of genetic information flow toward phenotypes.…”
Section: Discussionmentioning
confidence: 99%
“…The approach using predicted functional information proved to be more useful in this context, but more approaches and sources of information can also be incorporated with a focus on prioritizing biologically related genomic regions. Moreover, knowledge from multiple heterogeneous sources can be combined to further pinpoint potential QTLs, termed as poly-omics GP models (Wheeler et al, 2014 ; Uzunangelov et al, 2020 ). These information sources may include (i) predicted variants effects, (ii) gene functions e.g., GO, COEX, (iii) networks of gene-gene and protein-protein interactions, stored in public resources like STRING (Mering et al, 2003 ), GeneMANIA (Warde-Farley et al, 2010 ); (iv) pathways, in which genes are grouped e.g., KEGG (Kanehisa and Goto, 2000 ); (v) previously generated GWAS and QTL results which indicate involvement of particular regions for specific traits e.g., AraGWAS (Togninalli et al, 2020 ), AraQTL (Nijveen et al, 2017 ), (vi) known connections to phenotypes and (vii) endophenotypes, usually measured using -omics data at different stages of genetic information flow toward phenotypes.…”
Section: Discussionmentioning
confidence: 99%
“…The approach using predicted functional information proved to be more useful in this context, but more approaches and sources of information can also be incorporated with a focus on prioritizing biologically related genomic regions. Moreover, knowledge from multiple heterogeneous sources can be combined to further pinpoint potential QTLs, termed as poly-omics GP models (Wheeler, Aquino-Michaels et al 2014, Uzunangelov, Wong et al 2020). These information sources may include (i) predicted variants effects, (ii) gene functions e.g.…”
Section: Exploiting Biological Knowledge To Improve Genomic Predictionmentioning
confidence: 99%