Large scale genomic analysis of 3067 SARS-1 CoV-2 genomes reveals a clonal geo-distribution 2 and a rich genetic variations of hotspots 3 mutations 4 Abstract 33In late December 2019, an emerging viral infection COVID-19 was identified in Wuhan, 34China, and became a global pandemic. Characterization of the genetic variants of SARS-35CoV-2 is crucial in following and evaluating it spread across countries. In this study, we 36 collected and analyzed 3,067 SARS-CoV-2 genomes isolated from 55 countries during the 37 first three months after the onset of this virus. Using comparative genomics analysis, we 38 traced the profiles of the whole-genome mutations and compared the frequency of each 39 mutation in the studied population. The accumulation of mutations during the epidemic 40 period with their geographic locations was also monitored. The results showed 782 variant 41 sites, of which 512 (65.47%) had a non-synonymous effect. Frequencies of mutated alleles 42 revealed the presence of 38 recurrent non-synonymous mutations, including ten hotspot 43 mutations with a prevalence higher than 0.10 in this population and distributed in six 44 SARS-CoV-2 genes. The distribution of these recurrent mutations on the world map 45 revealed certain genotypes specific to the geographic location. We also found co-occurring 46 mutations resulting in the presence of several haplotypes. Moreover, evolution over time 47We have also created an inclusive unified database (http://genoma.ma/covid-19/) that lists 52 all of the genetic variants of the SARS-CoV-2 genomes found in this study with 53 phylogeographic analysis around the world. 54 55 56
In late December 2019, an emerging viral infection COVID-19 was identified in Wuhan, China, and became a global pandemic. Characterization of the genetic variants of SARS-CoV-2 is crucial in following and evaluating it spread across countries. In this study, we collected and analyzed 3,067 SARS-CoV-2 genomes isolated from 55 countries during the first three months after the onset of this virus. Using comparative genomics analysis, we traced the profiles of the whole-genome mutations and compared the frequency of each mutation in the studied population. The accumulation of mutations during the epidemic period with their geographic locations was also monitored. The results showed 782 variants sites, of which 512 (65.47%) had a non-synonymous effect. Frequencies of mutated alleles revealed the presence of 68 recurrent mutations, including ten hotspot non-synonymous mutations with a prevalence higher than 0.10 in this population and distributed in six SARS-CoV-2 genes. The distribution of these recurrent mutations on the world map revealed that certain genotypes are specific to geographic locations. We also identified co-occurring mutations resulting in the presence of several haplotypes. Moreover, evolution over time has shown a mechanism of mutation co-accumulation which might affect the severity and spread of the SARS-CoV-2. The phylogentic analysis identified two major Clades C1 and C2 harboring mutations L3606F and G614D, respectively and both emerging for the first time in China. On the other hand, analysis of the selective pressure revealed the presence of negatively selected residues that could be taken into considerations as therapeutic targets. We have also created an inclusive unified database (http://covid-19.medbiotech.ma) that lists all of the genetic variants of the SARS-CoV-2 genomes found in this study with phylogeographic analysis around the world.
The COVID-19 pandemic has been ongoing since its onset in late November 2019 in Wuhan, China. Understanding and monitoring the genetic evolution of the virus, its geographical characteristics, and its stability are particularly important for controlling the spread of the disease and especially for the development of a universal vaccine covering all circulating strains. From this perspective, we analyzed 30,983 complete SARS-CoV-2 genomes from 79 countries located in the six continents and collected from 24 December 2019, to 13 May 2020, according to the GISAID database. Our analysis revealed the presence of 3206 variant sites, with a uniform distribution of mutation types in different geographic areas. Remarkably, a low frequency of recurrent mutations has been observed; only 169 mutations (5.27%) had a prevalence greater than 1% of genomes. Nevertheless, fourteen non-synonymous hotspot mutations (>10%) have been identified at different locations along the viral genome; eight in ORF1ab polyprotein (in nsp2, nsp3, transmembrane domain, RdRp, helicase, exonuclease, and endoribonuclease), three in nucleocapsid protein, and one in each of three proteins: Spike, ORF3a, and ORF8. Moreover, 36 non-synonymous mutations were identified in the receptor-binding domain (RBD) of the spike protein with a low prevalence (<1%) across all genomes, of which only four could potentially enhance the binding of the SARS-CoV-2 spike protein to the human ACE2 receptor. These results along with intra-genomic divergence of SARS-CoV-2 could indicate that unlike the influenza virus or HIV viruses, SARS-CoV-2 has a low mutation rate which makes the development of an effective global vaccine very likely.
The Coronavirus disease 19 pandemic has been ongoing since its onset in late November 2019 in Wuhan, China. To date, the SARS-CoV-2 virus has infected more than 8 million people worldwide and killed over 5% of them. Efforts are being made all over the world to control the spread of the disease and most importantly to develop a vaccine. Understanding the genetic evolution of the virus, its geographic characteristics and stability is particularly important for developing a universal vaccine covering all circulating strains of SARS-CoV-2 and for predicting its efficacy. In this perspective, we analyzed the sequences of 30,983 complete genomes from 80 countries located in six geographical zones (Africa, Asia, Europe, North & South America, and Oceania) isolated from December 24, 2019 to May 13, 2020, and compared them to the reference genome.Our in-depth analysis revealed the presence of 3,206 variant sites compared to the reference Wuhan-Hu-1 genome, with a distribution that is largely uniform over all continents. Remarkably, a low frequency of recurrent mutations was observed; only 182 mutations (5.67%) had a prevalence greater than 1%. Nevertheless, fourteen hotspot mutations (> 10%) were identified at different locations, seven at the ORF1ab gene (in regions coding for nsp2, nsp3, nsp6, nsp12, nsp13, nsp14 and nsp15), three in the nucleocapsid protein, one in the spike protein, one in orf3a, and one in orf8. Moreover, 35 non-synonymous mutations were identified in the receptor-binding domain (RBD) of the spike protein with a low prevalence (<1%) across all genomes, of which only four could potentially enhance the binding of the SARS-CoV-2 spike protein to the human receptor ACE2.These results along with the phylogenetic analysis demonstrate that the virus does not have a significant divergence at the protein level compared to the reference both among and within different geographical areas. Unlike the influenza virus or HIV viruses, the slow rate of mutation of SARS-CoV-2 makes the potential of developing an effective global vaccine very likely.
BackgroundCarbapenem-resistant Acinetobacter baumannii has recently been defined by the World Health Organization as a critical pathogen. The aim of this study was to compare clonal diversity and carbapenemase-encoding genes of A. baumannii isolates collected from colonized or infected patients and hospital environment in two intensive care units (ICUs) in Morocco.MethodsThe patient and environmental sampling was carried out in the medical and surgical ICUs of Mohammed V Military teaching hospital from March to August 2015. All A. baumannii isolates recovered from clinical and environmental samples, were identified using routine microbiological techniques and Matrix-Assisted Laser Desorption/Ionization Time-of-Flight Mass Spectrometry. Antimicrobial susceptibility testing was performed using disc diffusion method. The carbapenemase-encoding genes were screened for by PCR. Clonal relatedness was analyzed by digestion of the DNA with low frequency restriction enzymes and pulsed field gel electrophoresis (PFGE) and the multi locus sequence typing (MLST) was performed on two selected isolates from two major pulsotypes.ResultsA total of 83 multidrug-resistant A. baumannii isolates were collected: 47 clinical isolates and 36 environmental isolates. All isolates were positive for the bla OXA51-like and bla OXA23-like genes. The coexistence of bla NDM-1 /bla OXA-23-like and bla OXA 24-like /bla OXA-23-like were detected in 27 (32.5%) and 2 (2.4%) of A. baumannii isolates, respectively. The environmental samples and the fecally-colonized patients were significantly identified (p < 0.05) as the most common sites of isolation of NDM-1-harboring isolates. PFGE grouped all isolates into 9 distinct clusters with two major groups (0007 and 0008) containing up to 59% of the isolates. The pulsotype 0008 corresponds to sequence type (ST) 195 while pulsotype 0007 corresponds to ST 1089.The genetic similarity between the clinical and environmental isolates was observed in 80/83 = 96.4% of all isolates, belonging to 7 pulsotypes.ConclusionThis study shows that the clonal spread of environmental A. baumannii isolates is related to that of clinical isolates recovered from colonized or infected patients, being both associated with a high prevalence of the bla OXA23-like and bla NDM-1genes. These findings emphasize the need for prioritizing the bio-cleaning of the hospital environment to control and prevent the dissemination of A. baumannii clonal lineages.Electronic supplementary materialThe online version of this article (10.1186/s13756-017-0262-4) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.