Salmonella enterica serovar Typhimurium (S. Typhimurium) is a zoonotic pathogen that causes diarrheal disease in humans and animals. During salmonellosis, S. Typhimurium colonizes epithelial cells lining the gastrointestinal tract. S. Typhimurium has an unusual lifestyle in epithelial cells that begins within an endocytic-derived Salmonella-containing vacuole (SCV), followed by escape into the cytosol, epithelial cell lysis and bacterial release. The cytosol is a more permissive environment than the SCV and supports rapid bacterial growth. The physicochemical conditions encountered by S. Typhimurium within the epithelial cytosol, and the bacterial genes required for cytosolic colonization, remain largely unknown. Here we have exploited the parallel colonization strategies of S. Typhimurium in epithelial cells to decipher the two niche-specific bacterial virulence programs. By combining a population-based RNA-seq approach with single-cell microscopic analysis, we identified bacterial genes with cytosol-induced or vacuole-induced expression signatures. Using these genes as environmental biosensors, we defined that Salmonella is exposed to oxidative stress and iron and manganese deprivation in the cytosol and zinc and magnesium deprivation in the SCV. Furthermore, iron availability was critical for optimal S. Typhimurium replication in the cytosol, as well as entC, fepB, soxS, mntH and sitA. Virulence genes that are typically associated with extracellular bacteria, namely Salmonella pathogenicity island 1 (SPI1) and SPI4, showed increased expression in the cytosol compared to vacuole. Our study reveals that the cytosolic and vacuolar S. Typhimurium virulence gene programs are unique to, and tailored for, residence within distinct intracellular compartments. This archetypical vacuole-adapted pathogen therefore requires extensive transcriptional reprogramming to successfully colonize the mammalian cytosol.
Background Accurate variant detection in the coding regions of the human genome is a key requirement for molecular diagnostics of Mendelian disorders. Efficiency of variant discovery from next-generation sequencing (NGS) data depends on multiple factors, including reproducible coverage biases of NGS methods and the performance of read alignment and variant calling software. Although variant caller benchmarks are published constantly, no previous publications have leveraged the full extent of available gold standard whole-genome (WGS) and whole-exome (WES) sequencing datasets. Results In this work, we systematically evaluated the performance of 4 popular short read aligners (Bowtie2, BWA, Isaac, and Novoalign) and 9 novel and well-established variant calling and filtering methods (Clair3, DeepVariant, Octopus, GATK, FreeBayes, and Strelka2) using a set of 14 “gold standard” WES and WGS datasets available from Genome In A Bottle (GIAB) consortium. Additionally, we have indirectly evaluated each pipeline’s performance using a set of 6 non-GIAB samples of African and Russian ethnicity. In our benchmark, Bowtie2 performed significantly worse than other aligners, suggesting it should not be used for medical variant calling. When other aligners were considered, the accuracy of variant discovery mostly depended on the variant caller and not the read aligner. Among the tested variant callers, DeepVariant consistently showed the best performance and the highest robustness. Other actively developed tools, such as Clair3, Octopus, and Strelka2, also performed well, although their efficiency had greater dependence on the quality and type of the input data. We have also compared the consistency of variant calls in GIAB and non-GIAB samples. With few important caveats, best-performing tools have shown little evidence of overfitting. Conclusions The results show surprisingly large differences in the performance of cutting-edge tools even in high confidence regions of the coding genome. This highlights the importance of regular benchmarking of quickly evolving tools and pipelines. We also discuss the need for a more diverse set of gold standard genomes that would include samples of African, Hispanic, or mixed ancestry. Additionally, there is also a need for better variant caller assessment in the repetitive regions of the coding genome.
We have developed an efficient and inexpensive pipeline for streamlining large-scale collection and genome sequencing of bacterial isolates. Evaluation of this method involved a worldwide research collaboration focused on the model organism Salmonella enterica, the 10KSG consortium. Following the optimization of a logistics pipeline that involved shipping isolates as thermolysates in ambient conditions, the project assembled a diverse collection of 10,419 isolates from low- and middle-income countries. The genomes were sequenced using the LITE pipeline for library construction, with a total reagent cost of less than USD$10 per genome. Our method can be applied to other large bacterial collections to underpin global collaborations.
The frequency of a genetic variant in a population is crucially important for accurate interpretation of known and novel variant effects in medical genetics. Recently, several large allele frequency databases, such as Genome Aggregation Database (gnomAD), have been created to serve as a global reference for such studies. However, frequencies of many rare alleles vary dramatically between populations, and population-specific allele frequency can be more informative than the global one. Many countries and regions (including Russia) remain poorly studied from the genetic perspective. Here, we report the first successful attempt to integrate genetic information between major medical genetic laboratories in Russia. We construct an expanded reference set of genetic variants by analyzing 6,096 exome samples collected in two major Russian cities of Moscow and St. Petersburg. An approximately tenfold increase in sample size compared to previous studies allowed us to identify genetically distinct clusters of individuals within an admixed population of Russia. We show that up to 18 known pathogenic variants are overrepresented in Russia compared to other European countries. We also identify several dozen high-impact variants that are present in healthy donors despite either being annotated as pathogenic in ClinVar or falling within genes associated with autosomal dominant disorders. The constructed database of genetic variant frequencies in Russia has been made available to the medical genetics community through a variant browser available at http://ruseq.ru.
Environmental perturbations impact multiple cellular traits, including gene expression. Bacteria respond to these stressful situations through complex gene interaction networks, thereby inducing stress tolerance and survival of cells. In this paper, we study the response mechanisms of E. coli when exposed to different environmental stressors via differential expression and co-expression analysis. Gene co-expression networks were generated and analyzed via Weighted Gene Co-expression Network Analysis (WGCNA). Based on the gene co-expression networks, genes with similar expression profiles were clustered into modules. The modules were analysed for identification of hub genes, enrichment of biological processes and transcription factors. In addition, we also studied the link between transcription factors and their differentially regulated targets to understand the regulatory mechanisms involved. These networks validate known gene interactions and provide new insights into genes mediating transcriptional regulation in specific stress environments, thus allowing for in silico hypothesis generation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.