2022
DOI: 10.1093/bioinformatics/btac259
|View full text |Cite
|
Sign up to set email alerts
|

PhyloPGM: boosting regulatory function prediction accuracy using evolutionary information

Abstract: Motivation The computational prediction of regulatory function associated with a genomic sequence is of utter importance in -omics study, which facilitates our understanding of the underlying mechanisms underpinning the vast gene regulatory network. Prominent examples in this area include the binding prediction of transcription factors in DNA regulatory regions, and predicting RNA–protein interaction in the context of post-transcriptional gene expression. However, existing computational metho… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 56 publications
0
3
0
Order By: Relevance
“…We evaluated Graphylo against several state-of-the-art machine learning models that operate on single-species data: (1) FactorNet, 16 a top-performing single-species TFBS predictor in a recent DREAM competition 17 but limited to using only sequence information; (2) FactorNet+phastCons, a modified version of FactorNet that uses as input both the human sequence of interest and its interspecies conservation level (average PhastCons score); (3) RNATracker, 41 a tool initially designed for RNA subcellular localization analysis but also effective at TFBS prediction; and (4) PhyloPGM applied to FactorNet prediction values, 39 a probabilistic graphical model that combines predictions made on orthologous and ancestral sequences to improve a base FactorNet’s accuracy. Performance was measured using area under the receiver operating characteristic curve (AUROC) for the RBP binding task and the area under the precision-recall curve (AUPR) for the TF binding task on a held-out test set that were not used for training.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…We evaluated Graphylo against several state-of-the-art machine learning models that operate on single-species data: (1) FactorNet, 16 a top-performing single-species TFBS predictor in a recent DREAM competition 17 but limited to using only sequence information; (2) FactorNet+phastCons, a modified version of FactorNet that uses as input both the human sequence of interest and its interspecies conservation level (average PhastCons score); (3) RNATracker, 41 a tool initially designed for RNA subcellular localization analysis but also effective at TFBS prediction; and (4) PhyloPGM applied to FactorNet prediction values, 39 a probabilistic graphical model that combines predictions made on orthologous and ancestral sequences to improve a base FactorNet’s accuracy. Performance was measured using area under the receiver operating characteristic curve (AUROC) for the RBP binding task and the area under the precision-recall curve (AUPR) for the TF binding task on a held-out test set that were not used for training.…”
Section: Resultsmentioning
confidence: 99%
“…Comparing the performance of these models to Graphylo is a good measurement of the advantages Graphylo is taking from evolutionary information. We also compared Graphylo against PhyloPGM 39 which is a probabilistic graphical model that successfully incorporates predictions from different nodes in the phylogenetic tree.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation