Pan-genome is defined as the set of orthologous and unique genes of a specific group of organisms. The pan-genome is composed by the core genome, accessory genome, and species- or strain-specific genes. The pan-genome is considered open or closed based on the alpha value of the Heap law. In an open pan-genome, the number of gene families will continuously increase with the addition of new genomes to the analysis, while in a closed pan-genome, the number of gene families will not increase considerably. The first step of a pan-genome analysis is the homogenization of genome annotation. The same software should be used to annotate genomes, such as GeneMark or RAST. Subsequently, several software are used to calculate the pan-genome such as BPGA, GET_HOMOLOGUES, PGAP, among others. This review presents all these initial steps for those who want to perform a pan-genome analysis, explaining key concepts of the area. Furthermore, we present the pan-genomic analysis of 9 bacterial species. These are the species with the highest number of genomes deposited in GenBank. We also show the influence of the identity and coverage parameters on the prediction of orthologous and paralogous genes. Finally, we cite the perspectives of several research areas where pan-genome analysis can be used to answer important issues.
The mangrove oysters (Crassostrea gasar) are molluscs native to the Amazonia region and their exploration and farming has increased considerably in recent years. These animals are farmed on beds built in the rivers of the Amazonia estuaries and, therefore, the composition of their microbiome should be directly influenced by environmental conditions. Our work aimed to evaluate the changes in bacterial composition of oyster's microbiota at two different seasons (rainy and dry). For this purpose, we amplified and sequenced the V3-V4 regions of the 16S rRNA gene. Sequencing was performed on the Illumina MiSeq platform. According to the rarefaction curve, the sampling effort was sufficient to describe the bacterial diversity in the samples. Alpha-diversity indexes showed that the bacterial microbiota of oysters is richer during the rainy season. This richness is possibly associated with the diversity at lower taxonomic levels, since the relative abundance of bacterial phyla in the two seasons remained relatively constant. The main phyla found include Firmicutes, Bacteroidetes, Actinobacteria, and Proteobacteria. Similar results were found for the species Crassostrea gigas, Crassostrea sikamea, and Crassostrea corteziensis. Beta-diversity analysis showed that the bacterial composition of oyster's gut microbiota was quite different in the two seasons. Our data demonstrate the close relationship between the environment and the microbiome of these molluscs, reinforcing the need for conservation and sustainable management of estuaries in the Amazonia.
Bacteriocins are antimicrobial peptides expressed by bacteria through ribosomal activity. In this study, we analyzed the diversity of bacteriocin-like genes in the Tucuruí-HPP using a whole-metagenome shotgun sequencing approach. Three layers of the water column were analyzed (photic, aphotic and sediment). Detection of bacteriocin-like genes was performed with blastx using the BAGEL4 database as subject sequences. In order to calculate the abundance of bacteriocin-like genes we also determined the number of 16S rRNA genes using blastn. Taxonomic analysis was performed using RAST server and the metagenome was assembled using IDBA-UD in order to recover the full sequence of a zoocin which had its three-dimensional structure determined. The photic zone presented the highest number of reads affiliated to bacteriocins. The most abundant bacteriocins were sonorensin, Klebicin D , pyocin and colicin. The zoocin model was composed of eight anti-parallel β-sheets and two α-helices with a Zn 2+ ion in the active site. This model was considerably stable during 10 ns of molecular dynamics simulation. We observed a high diversity of bacteriocins in the Tucuruí-HPP, demonstrating that the environment is an inexhaustible source for prospecting these molecules. Finally, the zoocin model can be used for further studies of substrate binding and molecular mechanisms involving peptidoglycan degradation.
Background Bacteriocins are defined as thermolabile peptides produced by bacteria with biological activity against taxonomically related species. These antimicrobial peptides have a wide application including disease treatment, food conservation, and probiotics. However, even with a large industrial and biotechnological application potential, these peptides are still poorly studied and explored. BADASS is software with a user-friendly graphical interface applied to the search and analysis of bacteriocin diversity in whole-metagenome shotgun sequencing data. Results The search for bacteriocin sequences is performed with tools such as BLAST or DIAMOND using the BAGEL4 database as a reference. The putative bacteriocin sequences identified are used to determine the abundance and richness of the three classes of bacteriocins. Abundance is calculated by comparing the reads identified as bacteriocins to the reads identified as 16S rRNA gene using SILVA database as a reference. BADASS has a complete pipeline that starts with the quality assessment of the raw data. At the end of the analysis, BADASS generates several plots of richness and abundance automatically as well as tabular files containing information about the main bacteriocins detected. The user is able to change the main parameters of the analysis in the graphical interface. To demonstrate how the software works, we used four datasets from WMS studies using default parameters. Lantibiotics were the most abundant bacteriocins in the four datasets. This class of bacteriocin is commonly produced by Streptomyces sp. Conclusions With a user-friendly graphical interface and a complete pipeline, BADASS proved to be a powerful tool for prospecting bacteriocin sequences in Whole-Metagenome Shotgun Sequencing (WMS) data. This tool is publicly available at https://sourceforge.net/projects/badass/.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.