2017
DOI: 10.1101/216127
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Aligning sequences to general graphs in O(V + mE) time

Abstract: Graphs are commonly used to represent sets of sequences. Either edges or nodes can be labeled by sequences, so that each path in the graph spells a concatenated sequence. Examples include graphs to represent genome assemblies, such as string graphs and de Bruijn graphs, and graphs to represent a pan-genome and hence the genetic variation present in a population. Being able to align sequencing reads to such graphs is a key step for many analyses and its applications include genome assembly, read error correctio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
38
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 32 publications
(39 citation statements)
references
References 27 publications
(35 reference statements)
1
38
0
Order By: Relevance
“…We formalize a reference graph as a tuple G r = (V r , E r ) of nodes V r and directed, labeled edges E r ⊆ V r × V r × Σ, where the alphabet Σ = {A, C, G, T} represents the four different nucleotides. Note that in contrast to sequence graphs [28], we label edges instead of nodes.…”
Section: Task Description: Alignment To Reference Graphsmentioning
confidence: 99%
See 1 more Smart Citation
“…We formalize a reference graph as a tuple G r = (V r , E r ) of nodes V r and directed, labeled edges E r ⊆ V r × V r × Σ, where the alphabet Σ = {A, C, G, T} represents the four different nucleotides. Note that in contrast to sequence graphs [28], we label edges instead of nodes.…”
Section: Task Description: Alignment To Reference Graphsmentioning
confidence: 99%
“…. , |q|} (equivalent to [28]). Traversing a state v, i ∈ V q a represents the alignment of the first i query characters ending at node v. In particular, query position i = 0 indicates that we have not yet matched any letters from the query.…”
Section: Alignment On Editmentioning
confidence: 99%
“…More efficient algorithms in terms of worst case time complexity have been earlier suggested in [18,19] for an important case of edit distance or Levenshtein distance (µ and σ equal to 1). Supplement Section "Shortest paths search in binary-weighted graphs" presents a simple modification of the basic approach described above achieving the same time complexity of O(|G| · |Sub|).…”
Section: Sequence To Graph Alignment Via Alignment Graphsmentioning
confidence: 99%
“…Again, while we only consider linear gap penalties with coefficient σ (in our experiments we used σ = 5) here, affine gap penalties can also be implemented without significant increase of running time or memory footprint [19].…”
Section: Alignment Of Amino-acid Sequencesmentioning
confidence: 99%
“…A few recent works have investigated extending Burrows-Wheeler-Transform-based indexing to sequence DAGs [41] and de-Bruijn graphs [2,29,40]. Similarly, there exist studies that have explored extension of the classic sequence-to-sequence alignment routines to graphs [19,20,30,36]. In our recent work [19], we presented new complexity results and algorithms for the alignment problem using general sequence-labeled graphs.…”
Section: Introductionmentioning
confidence: 99%