Motivation
Long-read phasing has been used for reconstructing diploid genomes, improving variant calling, and resolving microbial strains in metagenomics. However, the phasing blocks of existing methods are broken by large Structural Variations (SVs), and the efficiency is unsatisfactory for population-scale phasing.
Results
This paper presents a novel algorithm, LongPhase, which can simultaneously phase single nucleotide polymorphisms (SNPs) and SVs of a human genome in 10-20 minutes, 10x faster than the state-of-the-art WhatsHap, HapCUT2 and Margin. In particular, co-phasing SNPs and SVs produces much larger haplotype blocks (N50 = 25Mbp) than those of existing methods (N50 = 10-15Mbp). We show that LongPhase combined with Nanopore ultra-long reads is a cost-effective and highly contiguous solution, which can produce between one and 26 blocks per chromosome arm without the need for additional trios, chromosome-conformation, and strand-seq data.
Availability
LongPhase is freely available at https://github.com/twolinin/LongPhase/.
Supplementary information
Supplementary figures and tables are available online.
Accurate diagnosis of chronic, non-healing wounds is challenging and time-consuming because it can be caused by a variety of etiologies. This brief report presents an unusual case of a chronic wound lasting for 10 months investigated by deep metagenomic sequencing. Epstein-Barr virus (EBV) was identified in the wound and subsequently validated by in situ hybridization. Histopathologic examination eventually revealed that the non-healing wound was due to an EBV-associated NK/T cell lymphoma. By identifying mutations across the viral genome, the virus was classified as Type I EBV and clustered with others of geographic proximity. Our results suggest that metagenomic shotgun sequencing can not only rapidly and accurately identify the presence of underlying pathogens but also provide strain-level resolution for the surveillance of viral epidemiology.
Long-read phasing has been used for reconstructing diploid genomes, improving variant calling, and resolving microbial strains in metagenomics. However, the phasing blocks of existing methods are broken by large Structural Variations (SVs), and the efficiency is unsatisfactory for population-scale phasing. This paper presents an ultra-fast algorithm, LongPhase, which can simultaneously phase single nucleotide polymorphisms (SNPs) and SVs of a human genome in ~10-20 minutes, 10x faster than the state-of-the-art WhatsHap and Margin. In particular, LongPhase produces much larger phased blocks at almost chromosome level with only long reads (N50=26Mbp). We demonstrate that LongPhase combined with Nanopore is a cost-effective approach for providing chromosome-scale phasing without the need for additional trios, chromosome-conformation, and single-cell strand-seq data.
Brain abscess is a severe infectious disease with high mortality and mobility. Although culture-based techniques have been widely used for the investigation of microbial composition of brain abscess, these approaches are inherent biased. Recent studies using 16S ribosomal sequencing approaches revealed high complexity of the bacterial community involved in brain abscess but fail to detect fungal and viral composition. In the study, both culture-independent nanopore metagenomic sequencing and culture-based whole-genome sequencing using both the Illumina and the Nanopore platforms were conducted to investigate the microbial composition and genomic characterization in brain abscess. Culture-independent metagenomic sequencing revealed not only a larger taxonomic diversity of bacteria but also the presence of fungi and virus communities. The culture-based whole-genome sequencing identified a novel species in Prevotella and reconstructs a Streptococcus constellatus with a high GC-skew genome. Antibiotic-resistance genes CfxA and ErmF associated with resistance to penicillin and clindamycin were also identified in culture-based and culture-free sequencing. This study implies current understanding of brain abscess need to consider the broader diversity of microorganisms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.