2022
DOI: 10.1038/s41598-022-11656-0
|View full text |Cite
|
Sign up to set email alerts
|

Robust and scalable barcoding for massively parallel long-read sequencing

Abstract: Nucleic-acid barcoding is an enabling technique for many applications, but its use remains limited in emerging long-read sequencing technologies with intrinsically low raw accuracy. Here, we apply so-called NS-watermark barcodes, whose error correction capability was previously validated in silico, in a proof of concept where we synthesize 3840 NS-watermark barcodes and use them to asymmetrically tag and simultaneously sequence amplicons from two evolutionarily distant species (namely Bordetella pertussis and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 26 publications
0
4
0
Order By: Relevance
“…In these datasets, D1 D3 integrate the official nanopore barcode. In order to evaluate the robustness of the demultiplexing method and the performance of demultiplexing a large number of non-ONT barcodes [ 44 , 45 ], we increased the number of barcodes and generated simulated multi-sample sequencing data (D4 D7 and DB4 DB7), these barcodes were randomly generated, and a certain edit distance was guaranteed (edit distance = 14.5 ± 1.8). The barcode consist of three primary components: upstream flanking region, variable region, and downstream flanking region (Fig.…”
Section: Resultsmentioning
confidence: 99%
“…In these datasets, D1 D3 integrate the official nanopore barcode. In order to evaluate the robustness of the demultiplexing method and the performance of demultiplexing a large number of non-ONT barcodes [ 44 , 45 ], we increased the number of barcodes and generated simulated multi-sample sequencing data (D4 D7 and DB4 DB7), these barcodes were randomly generated, and a certain edit distance was guaranteed (edit distance = 14.5 ± 1.8). The barcode consist of three primary components: upstream flanking region, variable region, and downstream flanking region (Fig.…”
Section: Resultsmentioning
confidence: 99%
“…We did not use the theoretical relative abundances provided by ZymoBIOMICS as they might differ from the expected ones due to mishandling during library preparation (Supplementary Figure 1) [38]. Reads that mapped to the reference genomes of yeasts or remained unmapped were discarded to avoid overestimation in genome copies caused by eukaryotes [39], and potential contamination by e.g., barcode "crosstalk" [40].…”
Section: Mock Experimentsmentioning
confidence: 99%
“…In the most favorable case of independent random errors, three or more errors can occur frequently; burst errors such as stuttering or repeated deletions only make things worse. This paper explores a possible solution via the use of random barcodes ("randomers") [30], that is, barcode libraries of any desired size (≳ 10 6 , for example) whose codewords are approximately uniformly random, as generated by computer, with constraints of GC content, homopolymers, etc., easily imposed. Like "designed" barcodes, random barcodes would be synthesized in defined oligo pools, but with the difference that the number of pools could be as large as desired.…”
Section: Hawkins Et Al's "Free" Barcodesmentioning
confidence: 99%