2022
DOI: 10.1038/s41597-022-01762-z
|View full text |Cite
|
Sign up to set email alerts
|

Benchmarking second and third-generation sequencing platforms for microbial metagenomics

Abstract: Shotgun metagenomic sequencing is a common approach for studying the taxonomic diversity and metabolic potential of complex microbial communities. Current methods primarily use second generation short read sequencing, yet advances in third generation long read technologies provide opportunities to overcome some of the limitations of short read sequencing. Here, we compared seven platforms, encompassing second generation sequencers (Illumina HiSeq 300, MGI DNBSEQ-G400 and DNBSEQ-T7, ThermoFisher Ion GeneStudio … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
43
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 64 publications
(45 citation statements)
references
References 58 publications
2
43
0
Order By: Relevance
“…The Illumina SR approach recovered an average of 36 Gbp per sample (~ 240 bp paired-end reads), while the CCS PacBio LR approach recovered an average of 12 Gbp per sample (~ 6 kbp CCS reads) (Table S 1 ). This difference translated into SR metagenomic samples having, on average, ~3 times more sequenced base pairs than LR samples, which is within the expected output for single sequenced samples [ 8 ]. Additionally, LR metagenomic samples had slightly higher GC content (0.43) compared to their SR counterpart (0.38) when all reads were considered (Table S 1 ).…”
Section: Resultsmentioning
confidence: 96%
See 1 more Smart Citation
“…The Illumina SR approach recovered an average of 36 Gbp per sample (~ 240 bp paired-end reads), while the CCS PacBio LR approach recovered an average of 12 Gbp per sample (~ 6 kbp CCS reads) (Table S 1 ). This difference translated into SR metagenomic samples having, on average, ~3 times more sequenced base pairs than LR samples, which is within the expected output for single sequenced samples [ 8 ]. Additionally, LR metagenomic samples had slightly higher GC content (0.43) compared to their SR counterpart (0.38) when all reads were considered (Table S 1 ).…”
Section: Resultsmentioning
confidence: 96%
“…These differences likely reflect the inherent genomic characteristics of the groups enriched within each sequencing technology [ 46 ]. Biases in low and high GC content spaces are recognized for SR technologies [ 47 ], while reports on LR approaches have noted anywhere from minimal GC biases in mock communities [ 8 ] to a higher recovery of high GC content sequences in metagenomes [ 46 , 48 , 49 ]. According to our results, most SR-only species belonged to Bacteroidia , Alphaproteobacteria , and Gammaproteobacteria , whereas for LR-only species, Acidiimicrobiia , and Verrucomicrobiae were the two major classes.…”
Section: Resultsmentioning
confidence: 99%
“…Strains that are well distinguishable on the basis of 16S rRNA genes could be well identifiable at the species level, indicating a high accuracy of the chosen method. Although the Ion Torrent ® platform might produce higher rates of indel errors, especially in homopolymer regions [ 32 ], comparable performance for 16S amplicon sequencing to Illumina-based workflows has been demonstrated [ 33 ]. However, quality filtering steps and subsequent bioinformatic methods should be adapted to the technology used and validated.…”
Section: Discussionmentioning
confidence: 99%
“…For each sample, 1 μg of high-molecular-weight DNA (>10 kbp) was used to build the library. DNA was sheared into 150-bp fragments using an ultrasonicator (Covaris, Woburn, MA), and DNA fragment library construction was performed using the Ion Plus fragment library and Ion Xpress barcode adapters kit (Thermo Fisher Scientific, USA), as previously described ( 57 ). We used an Ion Proton sequencer and Ion GeneStudio S5 prime sequencer to sequence the libraries (Thermo Fisher Scientific, USA), with a minimum of 20 million 150-bp high-quality reads generated per library for luminal, mucosal, and enriched saliva samples.…”
Section: Methodsmentioning
confidence: 99%