2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2016
DOI: 10.1109/bibm.2016.7822731
|View full text |Cite
|
Sign up to set email alerts
|

A comparison of seed-and-extend techniques in modern DNA read alignment algorithms

Abstract: Abstract-DNA read alignment is a major step in genome analysis. However, as DNA reads continue to become longer, new approaches need to be developed to effectively use these longer reads in the alignment process. Modern aligners commonly use a two-step approach for read alignment: 1. seeding, 2. extension. In this paper, we have investigated various seeding and extension techniques used in modern DNA read alignment algorithms to find the best seeding and extension combinations. We developed an open source gene… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
26
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 27 publications
(27 citation statements)
references
References 17 publications
0
26
0
1
Order By: Relevance
“…Subfigure ii) shows the situation of i) for 2-step 3-mers, a subset of 3-mers that contains every second 3-mer merely. The first and last nucleotides of s are not covered by 2-step 3-mers and must be discovered by an additional extension For a reference R and query Q, the set of all MEMs is defined, in accordance with [6], as follows: For all pairs of reference-query positions (x, y), with R[x] = Q[y], 0 ≤ x < |R|, 0 ≤ y <|Q|, we get a MEM (q, r, l) as follows: Let i be the minimal value such that R…”
Section: Definitions and Notationsmentioning
confidence: 99%
See 2 more Smart Citations
“…Subfigure ii) shows the situation of i) for 2-step 3-mers, a subset of 3-mers that contains every second 3-mer merely. The first and last nucleotides of s are not covered by 2-step 3-mers and must be discovered by an additional extension For a reference R and query Q, the set of all MEMs is defined, in accordance with [6], as follows: For all pairs of reference-query positions (x, y), with R[x] = Q[y], 0 ≤ x < |R|, 0 ≤ y <|Q|, we get a MEM (q, r, l) as follows: Let i be the minimal value such that R…”
Section: Definitions and Notationsmentioning
confidence: 99%
“…Most high-throughput read aligners [1][2][3][4][5] perform the following three steps: seeding [6,7], seed processing (e.g. chaining) [8,9] and dynamic programming [10,11].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Via the algorithmic bridge proposed here (Alg. 1), it is possible to define a mapping between -mers and MEMs (maximal exact matches, see [6]). The mapping is surjective, because every MEM has at least one corresponding -mer.…”
mentioning
confidence: 99%
“…Computation of SMEMs out of MEMs.Algorithm 2a shows the Pseudocode for extraction of SMEMs of MEMs using a sorting followed by a single sweep over all MEMs. In order to prove the correctness of the algorithm, we characterize SMEMs (in accordance to[6,15]) as follows: A SMEM is a MEM that is not enclosed by another MEM on the query. The above algorithm identifies all enclosed MEMs by iterating over them ordered by their start positions on the query.…”
mentioning
confidence: 99%