2014
DOI: 10.1186/1471-2105-15-264
|View full text |Cite
|
Sign up to set email alerts
|

Enhancing the detection of barcoded reads in high throughput DNA sequencing data by controlling the false discovery rate

Abstract: BackgroundDNA barcodes are short unique sequences used to label DNA or RNA-derived samples in multiplexed deep sequencing experiments. During the demultiplexing step, barcodes must be detected and their position identified. In some cases (e.g., with PacBio SMRT), the position of the barcode and DNA context is not well defined. Many reads start inside the genomic insert so that adjacent primers might be missed. The matter is further complicated by coincidental similarities between barcode sequences and referenc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
11
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(12 citation statements)
references
References 31 publications
1
11
0
Order By: Relevance
“…It may be argued that LDPC barcodes are two or three times larger than commercial random barcodes currently in use and systematic barcoding designs based on Hamming or Golay codes, which at most require a handy number of bases. In agreement with [ 57 ], our results suggest that there could be a high price to paid for using such small barcoding systems, either high rates of critical undetected multiplexation errors or high rates of read losses must be tolerated.…”
Section: Discussionsupporting
confidence: 90%
“…It may be argued that LDPC barcodes are two or three times larger than commercial random barcodes currently in use and systematic barcoding designs based on Hamming or Golay codes, which at most require a handy number of bases. In agreement with [ 57 ], our results suggest that there could be a high price to paid for using such small barcoding systems, either high rates of critical undetected multiplexation errors or high rates of read losses must be tolerated.…”
Section: Discussionsupporting
confidence: 90%
“…Resent research shows that even for sequencing approaches the detection of barcodes is quite challenging. In [ 24 ] they focus on a specific problem with certain setups on the PacBio SMRT platform. Caused by technical reasons, sporadically barcodes are not present in the sequence data.…”
Section: Resultsmentioning
confidence: 99%
“…On codes based on sequence edit distance this extended problem is addressed for, e.g. the PacBio SMRT platform in [ 24 ]. We conjecture that the detection problem of barcodes based on watermarks can be solved in future investigations.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…It is therefore important to adopt proper quality control measures that include the use of control samples. Sequencing laboratories sporadically report carry-over contamination between successive sequencing experiments (49), or crosstalk between sequencing libraries in the case of multiplexed runs (50).…”
Section: Carry-overmentioning
confidence: 99%