2020
DOI: 10.1093/gigascience/giaa094
|View full text |Cite
|
Sign up to set email alerts
|

TGS-GapCloser: A fast and accurate gap closer for large genomes with low coverage of error-prone long reads

Abstract: Background Analyses that use genome assemblies are critically affected by the contiguity, completeness, and accuracy of those assemblies. In recent years single-molecule sequencing techniques generating long-read information have become available and enabled substantial improvement in contig length and genome completeness, especially for large genomes (>100 Mb), although bioinformatic tools for these applications are still limited. Find… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
174
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
2
1

Relationship

3
7

Authors

Journals

citations
Cited by 244 publications
(174 citation statements)
references
References 46 publications
0
174
0
Order By: Relevance
“…De novo assembly was performed by Supernova v2.1.1 (Supernova assembler, RRID:SCR_016756 ) [ 44 ] using the suggested parameters (--maxreads=2100000000 --accept-extreme-coverage --nopreflight). TGS-GapCloser v1.0.0 (TGS-GapCloser, RRID:SCR_017633 ) [ 45 , 46 ] was used to fill the gaps between contigs within same scaffolds, and this process was performed under the use of error-corrected ONT or PacBio data by Canu. The number of gaps within scaffolds was computed using the formula: number of contigs − number of scaffolds.…”
Section: Methodsmentioning
confidence: 99%
“…De novo assembly was performed by Supernova v2.1.1 (Supernova assembler, RRID:SCR_016756 ) [ 44 ] using the suggested parameters (--maxreads=2100000000 --accept-extreme-coverage --nopreflight). TGS-GapCloser v1.0.0 (TGS-GapCloser, RRID:SCR_017633 ) [ 45 , 46 ] was used to fill the gaps between contigs within same scaffolds, and this process was performed under the use of error-corrected ONT or PacBio data by Canu. The number of gaps within scaffolds was computed using the formula: number of contigs − number of scaffolds.…”
Section: Methodsmentioning
confidence: 99%
“…More detailed pipeline about scaffolding using Hi-C data is as described on protocol.io (dx.doi.org/10.17504/protocols.io.qradv2e, last accessed March 3, 2021). For the male genome, we used SUPERNOVA with 10X Genomics for initial assembly, then GAPCLOSER with short reads to fill gaps and TGS-GapCloser v1.0.0 ( Xu et al. 2020 ) with Nanopore reads to rescaffold using default parameters, and we finally used 3DDNA with Hi-C data to elevate the assembly to chromosome levels.…”
Section: Methodsmentioning
confidence: 99%
“…Long reads provide an effective and cost-efficient means to close gaps in fragmented assemblies, since these reads are often longer than genomic repeats, allowing one to bridge between neighboring contigs thus closing an assembly gap [7]. To close gaps in existing, more fragmented assemblies with long reads, several methods have been developed in the past, exemplified by PBJelly [8], FinisherSC [9], PacBio GenomicConsensus (PacBio GC; comprising the Arrow and Quiver methods; https://github.com/PacificBiosciences/GenomicConsensus), LR_gapcloser [10] and others [11][12][13]. These methods align a set of given long reads to an input assembly, determine which reads span assembly gaps and close these gaps with the new sequence.…”
Section: Introductionmentioning
confidence: 99%