2015
DOI: 10.1007/978-3-319-16706-0_29
|View full text |Cite
|
Sign up to set email alerts
|

Gap Filling as Exact Path Length Problem

Abstract: One of the last steps in a genome assembly project is filling the gaps between consecutive contigs in the scaffolds. This problem can be naturally stated as finding an s-t path in a directed graph whose sum of arc costs belongs to a given range (the estimate on the gap length). Here s and t are any two contigs flanking a gap. This problem is known to be NP-hard in general. Here we derive a simpler dynamic programming solution than already known, pseudo-polynomial in the maximum value of the input range. We imp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
3
1
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 20 publications
0
4
0
Order By: Relevance
“…With some optimizations however, the above algorithm can be accelerated. As observed by Salmela et al [39] in the context of gap-filling problem, we expect d 2 |V |, therefore, it should be possible to compute a sub-graph containing vertices within ≤ d 2 /2 distance from v 1 or v 2 , before solving the recurrence. While this strategy was shown to be effective for gap-filling between assembled contigs, the count of vertex pairs to evaluate during read mapping process is expected to be significantly higher for large read sets.…”
Section: A Pseudo-polynomial Time Algorithmmentioning
confidence: 76%
See 1 more Smart Citation
“…With some optimizations however, the above algorithm can be accelerated. As observed by Salmela et al [39] in the context of gap-filling problem, we expect d 2 |V |, therefore, it should be possible to compute a sub-graph containing vertices within ≤ d 2 /2 distance from v 1 or v 2 , before solving the recurrence. While this strategy was shown to be effective for gap-filling between assembled contigs, the count of vertex pairs to evaluate during read mapping process is expected to be significantly higher for large read sets.…”
Section: A Pseudo-polynomial Time Algorithmmentioning
confidence: 76%
“…The exact-path length problem determines if a path of a specified distance exists between two vertices in a weighted graph. An extension of this problem, referred to as the gap-filling problem [39], has been explored in the context of genome assembly using paired-end or mate pair read sets. Although the exact-path length problem has been shown to be N P-complete [33], we will demonstrate a simple and practical polynomial-time algorithm for our problem with unweighted edges.…”
Section: Related Problems In Graph Theorymentioning
confidence: 99%
“…The number of gaps could also be efficiently reduced by FGAP [22], which aligned long reads to the gaps using BLAST algorithm [23]. More tools modified the algorithm and extended for different purposes [24][25][26][27][28]. However, most tools mentioned above share the same crucial shortcoming: they only accept pre-error-corrected long reads or alternative assembled contigs.…”
Section: Problems In Current Tgs Assemblies and Tgs Gap-closing Toolsmentioning
confidence: 99%
“…about gene content) or perform comparative genomic analysis. When mate pairs are available, contigs can be fed to later assembly stages, such as scaffolding [34,2,20] and then gap filling [35,3].…”
Section: Introductionmentioning
confidence: 99%