Emergence of genomic diversity and recurrent mutations in SARS-CoV-2

Dorp, Lucy van; Acman, Mislav; Richard, Damien; Shaw, Liam; Ford, Charlotte; Ormond, Louise; Owen, Christopher J.; Pang, Juanita; Tan, Cedric; Boshier, Piers R.; Ortiz, Arturo Torres; Balloux, François

doi:10.1016/j.meegid.2020.104351

Cited by 806 publications

(924 citation statements)

References 40 publications

Supporting

Mentioning

854

Contrasting

Unclassified

Order By: Relevance

“…One possible approach is to mask out specific sites in the genome sequence where recurring errors are suspected, as suggested previously [24] . However, genuine recurrent mutations that may contain important information about properties of viral evolution [6,8,[25][26][27] are sometimes hard to distinguish from recurrent systematic errors, and this could obscure important biology. Here, we present data that we hope will help the community make the important decision as to how to treat potential errors in SARS-CoV-2 genome sequences.…”

Section: Figure 1: Effect Of Recurrent Sequencing Mutations On Phylogmentioning

confidence: 99%

“…Extremely rapid whole genome sequencing has enabled nearly real-time tracing of the evolution of the SARS-CoV-2 pandemic [1][2][3][4][5] . By leveraging sequence data produced by labs throughout the world, researchers can trace transmission of the virus across human populations [6][7][8][9][10][11][12][13][14] . Typically, viral evolution is encapsulated by a phylogenetic tree relating all of the virus samples in a large set to one another [5,[15][16][17][18][19] .…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Stability of SARS-CoV-2 Phylogenies

Turakhia

Thornlow

Gozashti

et al. 2020

Preprint

View full text Add to dashboard Cite

The SARS-CoV-2 pandemic has led to unprecedented, nearly real-time genetic tracing due to the rapid community sequencing response. Researchers immediately leveraged these data to infer the evolutionary relationships among viral samples and to study key biological questions, including whether host viral genome editing and recombination are features of SARS-CoV-2 evolution. This global sequencing effort is inherently decentralized and must rely on data collected by many labs using a wide variety of molecular and bioinformatic techniques. There is thus a strong possibility that systematic errors associated with lab-specific practices affect some sequences in the repositories. We find that some recurrent mutations in reported SARS-CoV-2 genome sequences have been observed predominantly or exclusively by single labs, co-localize with commonly used primer binding sites and are more likely to affect the protein coding sequences than other similarly recurrent mutations. We show that their inclusion can affect phylogenetic inference on scales relevant to local lineage tracing, and make it appear as though there has been an excess of recurrent mutation and/or recombination among viral lineages. We suggest how samples can be screened and problematic mutations removed. We also develop tools for comparing and visualizing differences among phylogenies and we show that consistent clade-and tree-based comparisons can be made between phylogenies produced by different groups. These will facilitate evolutionary inferences and comparisons among phylogenies produced for a wide array of purposes. Building on the SARS-CoV-2 Genome Browser at UCSC, we present a toolkit to compare, analyze and combine SARS-CoV-2 phylogenies, find and remove potential sequencing errors and establish a widely shared, stable clade structure for a more accurate scientific inference and discourse.Foreword:

show abstract

Section: Figure 1: Effect Of Recurrent Sequencing Mutations On Phylogmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Stability of SARS-CoV-2 Phylogenies

Turakhia

Thornlow

Gozashti

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

“…The closest relatives (RaTG13 and RmYN02, 96% and 93% nucleotide identity respectively) derive from the Intermediate Horseshoe bat (Rhinolophus affinis) and the Malayan Horseshoe bat (Rhinolophus malayanus) 2 , although the original host is yet to be conclusively identified 3 . Since spilling over to humans, the virus has diverged rapidly, but it is unclear whether these mutations have resulted in SARS-CoV-2 lineages with different epidemiological and evolutionary characteristics [4][5][6][7][8][9] . Several lineages have been highlighted for potential significance [4][5][6]9 .…”

Section: Introductionmentioning

confidence: 99%

“…Since spilling over to humans, the virus has diverged rapidly, but it is unclear whether these mutations have resulted in SARS-CoV-2 lineages with different epidemiological and evolutionary characteristics [4][5][6][7][8][9] . Several lineages have been highlighted for potential significance [4][5][6]9 . For consistency, we adopt the nomenclature outlined in 8 which classifies the initial lineages as A and B (previously labelled 'S' and 'L' 4 ).…”

Section: Introductionmentioning

confidence: 99%

“…For example, some lineages within a population can be rapidly expanding whereas others can be stationary 8 . Utilizing large numbers of sequences provided by GISAID 9 and recently developed phylodynamic tools, we interrogate SARS-CoV-2 population patterns to identify 'hidden' structure in the pandemic and investigate whether lineages are geographically partitioned and/or are on distinct demographic trajectories.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Emerging phylogenetic structure of the SARS-CoV-2 pandemic

Fountain-Jones

Appaw

Carver

et al. 2020

Preprint

View full text Add to dashboard Cite

Since spilling over into humans, SARS-CoV-2 has rapidly spread across the globe, accumulating significant genetic diversity. The structure of this genetic diversity, and whether it reveals epidemiological insights, are fundamental questions for understanding the evolutionary trajectory of this virus. Here we use a recently developed phylodynamic approach to uncover phylogenetic structures underlying the SARS-CoV-2 pandemic. We find support for three SARS-CoV-2 lineages co-circulating, each with significantly different demographic dynamics concordant with known epidemiological factors. For example, Lineage C emerged in Europe with a high growth rate in late February, just prior to the exponential increase in cases in several European countries. Mutations that characterize Lineage C in particular are non-synonymous and occur in functionally important gene regions responsible for viral replication and cell entry. Even though Lineages A and B had distinct demographic patterns, they were much more difficult to distinguish. Continuous application of phylogenetic approaches to track the evolutionary epidemiology of SARS-CoV-2 lineages will be increasingly important to validate the efficacy of control efforts and monitor significant evolutionary events in the future.

show abstract

Interface‐based design of the favipiravir‐binding site in SARS‐CoV‐2 RNA‐dependent RNA polymerase reveals mutations conferring resistance to chain termination

et al. 2021

View full text Add to dashboard Cite

Favipiravir is a broad-spectrum inhibitor of viral RNA-dependent RNA polymerase (RdRp) currently being used to manage COVID-19. Accumulation of mutations in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) RdRp may facilitate antigenic drift, generating favipiravir resistance. Focussing on the chain-termination mechanism utilized by favipiravir, we used highthroughput interface-based protein design to generate > 100 000 designs of the favipiravir-binding site of RdRp and identify mutational hotspots. We identified several single-point mutants and designs having a sequence identity of 97%-98% with wild-type RdRp, suggesting that SARS-CoV-2 can develop favipiravir resistance with few mutations. Out of 134 mutations documented in the CoV-GLUE database, 63 specific mutations were already predicted as resistant in our calculations, thus attaining~47% correlation with the sequencing data. These findings improve our understanding of the potential signatures of adaptation in SARS-CoV-2 against favipiravir.

show abstract

Emergence of genomic diversity and recurrent mutations in SARS-CoV-2

Cited by 806 publications

References 40 publications

Stability of SARS-CoV-2 Phylogenies

Stability of SARS-CoV-2 Phylogenies

Emerging phylogenetic structure of the SARS-CoV-2 pandemic

Interface‐based design of the favipiravir‐binding site in SARS‐CoV‐2 RNA‐dependent RNA polymerase reveals mutations conferring resistance to chain termination

Contact Info

Product

Resources

About