2000
DOI: 10.1089/106652700750050826
|View full text |Cite
|
Sign up to set email alerts
|

Algorithms for Extracting Structured Motifs Using a Suffix Tree with an Application to Promoter and Regulatory Site Consensus Identification

Abstract: This paper introduces two exact algorithms for extracting conserved structured motifs from a set of DNA sequences. Structured motifs may be described as an ordered collection of p > or = 1 "boxes" (each box corresponding to one part of the structured motif), p substitution rates (one for each box) and p - 1 intervals of distance (one for each pair of successive boxes in the collection). The contents of the boxes--that is, the motifs themselves--are unknown at the start of the algorithm. This is precisely what … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

2
172
1
7

Year Published

2003
2003
2019
2019

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 207 publications
(182 citation statements)
references
References 28 publications
(31 reference statements)
2
172
1
7
Order By: Relevance
“…Algorithms for structured motifs extraction [6] address the extraction of consensus motifs that appear together in a well-ordered and regularly spaced manner. A structured motif can be described as an ordered collection of p ≥ 1 boxes, a maximum allowed error (substitutions) for each box, and an interval of distance for each pair of consecutive boxes.…”
Section: Preliminariesmentioning
confidence: 99%
See 3 more Smart Citations
“…Algorithms for structured motifs extraction [6] address the extraction of consensus motifs that appear together in a well-ordered and regularly spaced manner. A structured motif can be described as an ordered collection of p ≥ 1 boxes, a maximum allowed error (substitutions) for each box, and an interval of distance for each pair of consecutive boxes.…”
Section: Preliminariesmentioning
confidence: 99%
“…No constraint, and therefore no statistical value is put on the distances separating them. This paper is based on the only previous algorithm that is able to identify motifs composed of any number of boxes -structured motifs [6]. There are two central problems concerning motifs in sequences: localization and extraction [13].…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…The efficiency of the filter relies on an original data structure, the bi-factor array, that is also introduced in this paper, and on a labelling of the seeds similar to the one employed in [8]. This new data structure can be used to speed up other tasks such as the inference of structured motifs [18] or for improving other filters [14].…”
Section: Introductionmentioning
confidence: 99%