Large scale genomic analysis of 3067 SARS-1 CoV-2 genomes reveals a clonal geo-distribution 2 and a rich genetic variations of hotspots 3 mutations 4 Abstract 33In late December 2019, an emerging viral infection COVID-19 was identified in Wuhan, 34China, and became a global pandemic. Characterization of the genetic variants of SARS-35CoV-2 is crucial in following and evaluating it spread across countries. In this study, we 36 collected and analyzed 3,067 SARS-CoV-2 genomes isolated from 55 countries during the 37 first three months after the onset of this virus. Using comparative genomics analysis, we 38 traced the profiles of the whole-genome mutations and compared the frequency of each 39 mutation in the studied population. The accumulation of mutations during the epidemic 40 period with their geographic locations was also monitored. The results showed 782 variant 41 sites, of which 512 (65.47%) had a non-synonymous effect. Frequencies of mutated alleles 42 revealed the presence of 38 recurrent non-synonymous mutations, including ten hotspot 43 mutations with a prevalence higher than 0.10 in this population and distributed in six 44 SARS-CoV-2 genes. The distribution of these recurrent mutations on the world map 45 revealed certain genotypes specific to the geographic location. We also found co-occurring 46 mutations resulting in the presence of several haplotypes. Moreover, evolution over time 47We have also created an inclusive unified database (http://genoma.ma/covid-19/) that lists 52 all of the genetic variants of the SARS-CoV-2 genomes found in this study with 53 phylogeographic analysis around the world. 54 55 56
Investment in SARS-CoV-2 sequencing in Africa over the past year has led to a major increase in the number of sequences generated, now exceeding 100,000 genomes, used to track the pandemic on the continent. Our results show an increase in the number of African countries able to sequence domestically, and highlight that local sequencing enables faster turnaround time and more regular routine surveillance. Despite limitations of low testing proportions, findings from this genomic surveillance study underscore the heterogeneous nature of the pandemic and shed light on the distinct dispersal dynamics of Variants of Concern, particularly Alpha, Beta, Delta, and Omicron, on the continent. Sustained investment for diagnostics and genomic surveillance in Africa is needed as the virus continues to evolve, while the continent faces many emerging and re-emerging infectious disease threats. These investments are crucial for pandemic preparedness and response and will serve the health of the continent well into the 21st century.
In late December 2019, an emerging viral infection COVID-19 was identified in Wuhan, China, and became a global pandemic. Characterization of the genetic variants of SARS-CoV-2 is crucial in following and evaluating it spread across countries. In this study, we collected and analyzed 3,067 SARS-CoV-2 genomes isolated from 55 countries during the first three months after the onset of this virus. Using comparative genomics analysis, we traced the profiles of the whole-genome mutations and compared the frequency of each mutation in the studied population. The accumulation of mutations during the epidemic period with their geographic locations was also monitored. The results showed 782 variants sites, of which 512 (65.47%) had a non-synonymous effect. Frequencies of mutated alleles revealed the presence of 68 recurrent mutations, including ten hotspot non-synonymous mutations with a prevalence higher than 0.10 in this population and distributed in six SARS-CoV-2 genes. The distribution of these recurrent mutations on the world map revealed that certain genotypes are specific to geographic locations. We also identified co-occurring mutations resulting in the presence of several haplotypes. Moreover, evolution over time has shown a mechanism of mutation co-accumulation which might affect the severity and spread of the SARS-CoV-2. The phylogentic analysis identified two major Clades C1 and C2 harboring mutations L3606F and G614D, respectively and both emerging for the first time in China. On the other hand, analysis of the selective pressure revealed the presence of negatively selected residues that could be taken into considerations as therapeutic targets. We have also created an inclusive unified database (http://covid-19.medbiotech.ma) that lists all of the genetic variants of the SARS-CoV-2 genomes found in this study with phylogeographic analysis around the world.
The COVID-19 pandemic has been ongoing since its onset in late November 2019 in Wuhan, China. Understanding and monitoring the genetic evolution of the virus, its geographical characteristics, and its stability are particularly important for controlling the spread of the disease and especially for the development of a universal vaccine covering all circulating strains. From this perspective, we analyzed 30,983 complete SARS-CoV-2 genomes from 79 countries located in the six continents and collected from 24 December 2019, to 13 May 2020, according to the GISAID database. Our analysis revealed the presence of 3206 variant sites, with a uniform distribution of mutation types in different geographic areas. Remarkably, a low frequency of recurrent mutations has been observed; only 169 mutations (5.27%) had a prevalence greater than 1% of genomes. Nevertheless, fourteen non-synonymous hotspot mutations (>10%) have been identified at different locations along the viral genome; eight in ORF1ab polyprotein (in nsp2, nsp3, transmembrane domain, RdRp, helicase, exonuclease, and endoribonuclease), three in nucleocapsid protein, and one in each of three proteins: Spike, ORF3a, and ORF8. Moreover, 36 non-synonymous mutations were identified in the receptor-binding domain (RBD) of the spike protein with a low prevalence (<1%) across all genomes, of which only four could potentially enhance the binding of the SARS-CoV-2 spike protein to the human ACE2 receptor. These results along with intra-genomic divergence of SARS-CoV-2 could indicate that unlike the influenza virus or HIV viruses, SARS-CoV-2 has a low mutation rate which makes the development of an effective global vaccine very likely.
The Coronavirus disease 19 pandemic has been ongoing since its onset in late November 2019 in Wuhan, China. To date, the SARS-CoV-2 virus has infected more than 8 million people worldwide and killed over 5% of them. Efforts are being made all over the world to control the spread of the disease and most importantly to develop a vaccine. Understanding the genetic evolution of the virus, its geographic characteristics and stability is particularly important for developing a universal vaccine covering all circulating strains of SARS-CoV-2 and for predicting its efficacy. In this perspective, we analyzed the sequences of 30,983 complete genomes from 80 countries located in six geographical zones (Africa, Asia, Europe, North & South America, and Oceania) isolated from December 24, 2019 to May 13, 2020, and compared them to the reference genome.Our in-depth analysis revealed the presence of 3,206 variant sites compared to the reference Wuhan-Hu-1 genome, with a distribution that is largely uniform over all continents. Remarkably, a low frequency of recurrent mutations was observed; only 182 mutations (5.67%) had a prevalence greater than 1%. Nevertheless, fourteen hotspot mutations (> 10%) were identified at different locations, seven at the ORF1ab gene (in regions coding for nsp2, nsp3, nsp6, nsp12, nsp13, nsp14 and nsp15), three in the nucleocapsid protein, one in the spike protein, one in orf3a, and one in orf8. Moreover, 35 non-synonymous mutations were identified in the receptor-binding domain (RBD) of the spike protein with a low prevalence (<1%) across all genomes, of which only four could potentially enhance the binding of the SARS-CoV-2 spike protein to the human receptor ACE2.These results along with the phylogenetic analysis demonstrate that the virus does not have a significant divergence at the protein level compared to the reference both among and within different geographical areas. Unlike the influenza virus or HIV viruses, the slow rate of mutation of SARS-CoV-2 makes the potential of developing an effective global vaccine very likely.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.