2020
DOI: 10.1101/2020.03.17.994566
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Estimating Assembly Base Errors Using K-mer Abundance Difference (KAD) Between Short Reads and Genome Assembled Sequences

Abstract: Genome sequences provide genomic maps with a single-base resolution for exploring genetic contents. Sequencing technologies, particularly long reads, have revolutionized genome assemblies for producing highly continuous genome sequences. However, current long-read sequencing technologies generate inaccurate reads that contain many errors. Some errors are retained in assembled sequences, which are typically not completely corrected by using either long reads or more accurate short reads. The issue commonly exis… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 41 publications
0
1
0
Order By: Relevance
“…The base accuracy of the A188Ref1 assembly was estimated at approximately 99.82% using the KAD pipeline 17 . Approximately 96.4% of the potential errors are in transposons or other repetitive sequences.…”
Section: Chromosome-level A188 Assemblymentioning
confidence: 99%
“…The base accuracy of the A188Ref1 assembly was estimated at approximately 99.82% using the KAD pipeline 17 . Approximately 96.4% of the potential errors are in transposons or other repetitive sequences.…”
Section: Chromosome-level A188 Assemblymentioning
confidence: 99%