2022
DOI: 10.1093/bib/bbac425
|View full text |Cite
|
Sign up to set email alerts
|

PlantBind: an attention-based multi-label neural network for predicting plant transcription factor binding sites

Abstract: Identification of transcription factor binding sites (TFBSs) is essential to understanding of gene regulation. Designing computational models for accurate prediction of TFBSs is crucial because it is not feasible to experimentally assay all transcription factors (TFs) in all sequenced eukaryotic genomes. Although many methods have been proposed for the identification of TFBSs in humans, methods designed for plants are comparatively underdeveloped. Here, we present PlantBind, a method for integrated prediction … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(9 citation statements)
references
References 46 publications
0
9
0
Order By: Relevance
“…Transcription factors from the bHLH family are known to bind both E‐boxes and N‐boxes (Li et al ., 2006), but little is known regarding the nucleotide preference of each protein, with new softwares emerging to predict TF targets (e.g. Yan et al ., 2022; Cheng et al ., 2023). The S. viridis PEPC1 promoter sequence contains seven different E‐Boxes and two different N‐Boxes variants, making a total of 17 putative bHLH binding sites (Fig.…”
Section: Resultsmentioning
confidence: 99%
“…Transcription factors from the bHLH family are known to bind both E‐boxes and N‐boxes (Li et al ., 2006), but little is known regarding the nucleotide preference of each protein, with new softwares emerging to predict TF targets (e.g. Yan et al ., 2022; Cheng et al ., 2023). The S. viridis PEPC1 promoter sequence contains seven different E‐Boxes and two different N‐Boxes variants, making a total of 17 putative bHLH binding sites (Fig.…”
Section: Resultsmentioning
confidence: 99%
“…Additionally, we used a large volume of DAP-seq data for Arabidopsis thaliana to fairly evaluate different software for the identification of TFBS in plants. This DAP-seq data was previously utilized by tools like TSPTFBS [51], PlantBind [53], and TSPTFBS 2.0 [63]. This dataset covered the peak data for 35 plant transcription factor families, as shown in Figure 3 with (a) a line plot and (b) a pie chart.…”
Section: Methodsmentioning
confidence: 99%
“…Many TFs share a similar binding motif but yet differ in their binding due to local surroundings and contexts ( Figure 1a & b ). Factors like DNA shape also play an important role in determining the binding sites for the TFs as shown very recently by some, including couple of software implementing shape information and observing performance improvement ( Mejía-Guerra and Buckler 2019; Ji et al 2021; Yan et al 2022; Sielemann et al 2021 ). Further to this, the choice of negative datasets with most of the software has been very relaxed as they randomly pick sequences and give too much weight to the consensus motif which can actually occur even in the non binding regions, creating weak datasets on which learnings have been done so far.…”
Section: Introductionmentioning
confidence: 99%