2023
DOI: 10.1128/spectrum.03328-22
|View full text |Cite
|
Sign up to set email alerts
|

Improved Assembly of Metagenome-Assembled Genomes and Viruses in Tibetan Saline Lake Sediment by HiFi Metagenomic Sequencing

Abstract: To expand the understanding of microbial dark matter in the environment, we did the first comparative evaluation of multiple assembly strategies based on high-throughput short-read and HiFi data from lake sediments metagenomic sequencing. The results demonstrated great improvement of the ‘Hybrid’ assembly method (short-read next-generation sequencing data plus HiFi data) in the recovery of medium/high-quality MAGs and viral genomes.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

2
11
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 15 publications
(13 citation statements)
references
References 80 publications
2
11
0
Order By: Relevance
“…These differences likely reflect the inherent genomic characteristics of the groups enriched within each sequencing technology [ 46 ]. Biases in low and high GC content spaces are recognized for SR technologies [ 47 ], while reports on LR approaches have noted anywhere from minimal GC biases in mock communities [ 8 ] to a higher recovery of high GC content sequences in metagenomes [ 46 , 48 , 49 ]. According to our results, most SR-only species belonged to Bacteroidia , Alphaproteobacteria , and Gammaproteobacteria , whereas for LR-only species, Acidiimicrobiia , and Verrucomicrobiae were the two major classes.…”
Section: Resultsmentioning
confidence: 99%
“…These differences likely reflect the inherent genomic characteristics of the groups enriched within each sequencing technology [ 46 ]. Biases in low and high GC content spaces are recognized for SR technologies [ 47 ], while reports on LR approaches have noted anywhere from minimal GC biases in mock communities [ 8 ] to a higher recovery of high GC content sequences in metagenomes [ 46 , 48 , 49 ]. According to our results, most SR-only species belonged to Bacteroidia , Alphaproteobacteria , and Gammaproteobacteria , whereas for LR-only species, Acidiimicrobiia , and Verrucomicrobiae were the two major classes.…”
Section: Resultsmentioning
confidence: 99%
“…Hybrid assembly approaches, leveraging complementary beneficial attributes of both LR and SR platforms to overcome their limitations, are already being used to study microbial communities in various evniromenments [1][2][3][4][5][6][7][8][9][10][11][12][13][14][15] . Assembly of LRs offers a major advantage over SR data alone, due to the ability to achieve greater contiguity, however, it comes at a significant cost due to lower accuracy 1,5,8,11,14,17,[21][22][23][24]32 .…”
Section: Correction and Polishing Affect Gene-and Genome-centric Comm...mentioning
confidence: 99%
“…Though the increasing variety of high-throughput short-and long-read (meta)genomic sequencing technologies are only within their first decades of existence, both the sequencing technologies and software development have flourished and has already been implemented to study microbial ecosystems [1][2][3][4][5][6][7][8][9][10][11][12][13][14][15] . Integrating both short-and long-read platforms for single microorganisms or microbial communities is gaining popularity because they compensate for the…”
Section: Background/introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…While originally limited by high sequence error rates, novel strategies such as the PacBio HiFi sequencing enable the recovery of sequence qualities comparable to short reads (9). PacBio HiFi has recently been applied to various sample types, including human and sheep faecal samples (10, 11), chicken intestinal samples (12), anaerobic digesters (13), seawater (14), and saline lake sediments (15). However, the costs per data unit for long-read technologies are still several orders of magnitude higher than short-read technologies, which has hampered their widespread adoption.…”
Section: Introductionmentioning
confidence: 99%