2022
DOI: 10.1186/s12859-022-04821-9
|View full text |Cite
|
Sign up to set email alerts
|

StackEPI: identification of cell line-specific enhancer–promoter interactions based on stacking ensemble learning

Abstract: Background Understanding the regulatory role of enhancer–promoter interactions (EPIs) on specific gene expression in cells contributes to the understanding of gene regulation, cell differentiation, etc., and its identification has been a challenging task. On the one hand, using traditional wet experimental methods to identify EPIs often means a lot of human labor and time costs. On the other hand, although the currently proposed computational methods have good recognition effects, they generall… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 46 publications
0
5
0
Order By: Relevance
“…They first train a model to discriminate true vs. false E/G based on distinctive features from the 1D data using a reference dataset of known E/G (most often determined using a combination of 1D data for enhancer and promoter identification and 3D or genetic data for the relationship identification) and a dataset of unsupported E/G as a negative control. When provided with a new relationship associated to 1D data attributes, the model is used to decide whether the relationship is more likely to be a true E/G than a false E/G ([29, 2, 14, 30, 36, 6, 37, 13, 20, 4, 15, 11]) (Figure 3).…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…They first train a model to discriminate true vs. false E/G based on distinctive features from the 1D data using a reference dataset of known E/G (most often determined using a combination of 1D data for enhancer and promoter identification and 3D or genetic data for the relationship identification) and a dataset of unsupported E/G as a negative control. When provided with a new relationship associated to 1D data attributes, the model is used to decide whether the relationship is more likely to be a true E/G than a false E/G ([29, 2, 14, 30, 36, 6, 37, 13, 20, 4, 15, 11]) (Figure 3).…”
Section: Resultsmentioning
confidence: 99%
“…Accordingly we also had to modify the sheffield.correlation.py script: 1) we added an all equal function to prevent divisions by zero in the Calculate Correlation function and 2) we added code to create the gene summary file that the script was supposed to take as input. The complete process to run the Sheffield method on the BENGI set can be found on this page 11 Finally we ran the Run-Average-Rank.sh script that evaluates the Average-Rank method on a BENGI set. This script takes as input a string defining the BENGI set and the version of the BENGI set, and outputs a 7 column tabulated file including for each E/G of the BENGI set, 1 or 0 according to whether this E/G is true or false in the BENGI set, the average rank score, the distance score, the correlation score, the distance rank, the correlation rank and the average rank between the distance and the correlation.…”
Section: Methods Evaluation On the Bengi Setsmentioning
confidence: 99%
See 1 more Smart Citation
“…The performance of the model as a whole is enhanced by the output module’s subsequent use of a CNN and dense layer combination to further enhance these important properties. Recently, Fan and Peng [ 27 ] introduced a technique known as StackEPI, which merges several feature representations and classical machine learning algorithms, employs a stacking ensemble approach, and performs the prediction process solely based on promoter and enhancer gene sequences.…”
Section: Introductionmentioning
confidence: 99%
“…The regulation of transcription by enhancers has been studied since the 1980s [ 43 ]. Enhancers in genetics are considered short (50–1500 base pairs) DNA regions to which proteins (activators) bind to increase the likelihood of transcription of a particular gene [ 44 , 45 ]. Often, the proteins that bind to these enhancers are called transcription factors.…”
Section: Introductionmentioning
confidence: 99%