2015
DOI: 10.1002/0471250953.bi0215s51
|View full text |Cite
|
Sign up to set email alerts
|

DNA Motif Databases and Their Uses

Abstract: Transcription factors (TFs) recognize and bind to specific DNA sequences. The specificity of a TF is usually represented as a position weight matrix (PWM). Several databases of DNA motifs exist and are used in biological research to address important biological questions. This overview describes PWMs and some of the most commonly used motif databases, as well as a few of their common applications. © 2015 by John Wiley & Sons, Inc.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 20 publications
(20 citation statements)
references
References 18 publications
0
20
0
Order By: Relevance
“…Though ChIP can experimentally identify TF-bound regions, existing data sets in the Drosophila embryo are low resolution, with ~100 base pair peaks (Li et al, 2008; MacArthur et al, 2009; Roy et al, 2010), which are longer than the ~10 bp TF binding sites (Zhu et al, 2011). Therefore, we predict TF binding sites using experimentally measured binding motifs (Stormo, 2015). To select a threshold above which a sequence is deemed a “true” binding site, we develop a principled approach, scoring the aligned sequences used to create the motifs and setting a threshold such that 75% of these aligned sequences are predicted as “true” (see Materials and Methods).…”
Section: Resultsmentioning
confidence: 99%
“…Though ChIP can experimentally identify TF-bound regions, existing data sets in the Drosophila embryo are low resolution, with ~100 base pair peaks (Li et al, 2008; MacArthur et al, 2009; Roy et al, 2010), which are longer than the ~10 bp TF binding sites (Zhu et al, 2011). Therefore, we predict TF binding sites using experimentally measured binding motifs (Stormo, 2015). To select a threshold above which a sequence is deemed a “true” binding site, we develop a principled approach, scoring the aligned sequences used to create the motifs and setting a threshold such that 75% of these aligned sequences are predicted as “true” (see Materials and Methods).…”
Section: Resultsmentioning
confidence: 99%
“…To identify the local sequence and structure properties that determine specificity, we fit the entire distribution of K A,rel values to two analytical models of RNA sequence specificity. First, a position weight matrix (PWM) model was used that only considers the identity of a nucleobase at each position in the loop (28). The affinity distribution was also fit to a PWM model that includes interaction coefficients (IC values) between nucleobases that account for positive and negative effects of sequence variation between positions in the binding site (29,30).…”
Section: Ess3mentioning
confidence: 99%
“…The precise location of binding sites could be predicted by computational methods based on binding motif models such as positional weight matrices (PWMs). Direct information on binding specificities is known for about 30–50% of ∼1400 human TFs ( 8 ), and is stored in many proprietary and open access collections such as JASPAR ( 9 ), TRANSFAC ( 10 ), SwissRegulon ( 11 ), HOCOMOCO ( 12 ) and others ( 13 ). From the beginning, the idea of HOCOMOCO was to provide a single binding model for each transcription factor, except for TFs exhibiting two distinctly different and well-confirmed binding specificities.…”
Section: Introductionmentioning
confidence: 99%