Arbuscular mycorrhizal fungi (AMF) are plant root symbionts that play key roles in plant growth and soil fertility. They are obligate biotrophic fungi that form coenocytic multinucleated hyphae and spores. Numerous studies have shown that diverse microorganisms live on the surface of and inside their mycelia, resulting in a metagenome when whole-genome sequencing (WGS) data are obtained from sequencing AMF cultivated in vivo. The metagenome contains not only the AMF sequences, but also those from associated microorganisms. In this study, we introduce a novel bioinformatics program, Spore-associated Symbiotic Microbes ( SeSaMe ), designed for taxonomic classification of short sequences obtained by next-generation DNA sequencing. A genus-specific usage bias database was created based on amino acid usage and codon usage of a three consecutive codon DNA 9-mer encoding an amino acid trimer in a protein secondary structure. The program distinguishes between coding sequence (CDS) and non-CDS, and classifies a query sequence into a genus group out of 54 genera used as reference. The mean percentages of correct predictions of the CDS and the non-CDS test sets at the genus level were 71% and 50% for bacteria, 68% and 73% for fungi (excluding AMF), and 49% and 72% for AMF ( Rhizophagus irregularis ), respectively. SeSaMe provides not only a means for estimating taxonomic diversity and abundance but also the gene reservoir of the reference taxonomic groups associated with AMF. Therefore, it enables users to study the symbiotic roles of associated microorganisms. It can also be applicable to other microorganisms as well as soil metagenomes. SeSaMe is freely available at www.fungalsesame.org .
Arbuscular mycorrhizal fungi (AMF) are plant root symbionts that play key roles in plant growth and soil fertility. They are obligate biotrophic fungi that form coenocytic multinucleated hyphae and spores.Numerous studies have shown that diverse microorganisms live on the surface and inside their mycelia, resulting in a metagenome when whole genome sequencing (WGS) data are obtained from sequencing AMF cultivated in vivo. The metagenome contains not only the AMF sequences, but also those from associated microorganisms. In this article, we introduce a novel bioinformatics program, SeSaMe, designed for taxonomic classification of short sequences obtained by next-generation DNA sequencing.A genus-specific usage bias database was created based on amino acid usage and codon usage of three consecutive codon DNA 9-mers encoding for an amino acid trimer in a protein secondary structure.The program distinguishes between coding sequence (CDS) and non-CDS, and classifies a query sequence into a genus group out of 54 genera used as reference. The average correct prediction percentages of the CDS and the non-CDS test sets at the genus level were 71% and 50% for bacteria, 65% and 73% for fungi (excluding AMF), and 49% and 72% for AMF (Rhizophagus irregularis), respectively. The program provides a means for estimating not only taxonomic diversity and abundance but also the gene reservoir of the reference taxonomic groups associated with AMF. Therefore, the program enables users to study the symbiotic roles of associated microorganisms. SeSaMe can be applicable to other microorganisms as well as soil metagenomes. It is freely available at www.journal.com and www.fungalsesame.org.
Although a large number of databases are available for regulatory elements, a bottleneck has been created by the lack of bioinformatics tools to predict the interaction modes of regulatory elements. To reduce this gap, we developed the Arabidopsis Transcription Regulatory Factor Domain/Domain Interaction Analysis Tool–liquid/liquid phase separation (LLPS), oligomerization, GO analysis (ART FOUNDATION-LOG), a useful toolkit for protein–nucleic acid interaction (PNI) and protein–protein interaction (PPI) analysis based on domain–domain interactions (DDIs). LLPS, protein oligomerization, the structural properties of protein domains, and protein modifications are major components in the orchestration of the spatiotemporal dynamics of PPIs and PNIs. Our goal is to integrate PPI/PNI information into the development of a prediction model for identifying important genetic variants in peaches. Our program unified interdatabase relational keys based on protein domains to facilitate inference from the model species. A key advantage of this program lies in the integrated information of related features, such as protein oligomerization, LOG analysis, structural characterizations of domains (e.g., domain linkers, intrinsically disordered regions, DDIs, domain–motif (peptide) interactions, beta sheets, and transmembrane helices), and post-translational modification. We provided simple tests to demonstrate how to use this program, which can be applied to other eukaryotic organisms.
Although there are a large number of databases available for regulatory elements, bottleneck has been created by the lack of bioinformatics tools for predicting types of mechanisms underlying actions of regulatory elements. To reduce the gap, we developed ARabidopsis Transcription regulatory Factor Domain-domain interaction Analysis Tool- Liquid-liquid phase separation (LLPS), Oligomerization, GO analysis (ART FounDATion-LOG), a useful toolkit for protein-nucleic acid interactions (PNI) and protein-protein interactions (PPI) analysis based on domain-domain interaction (DDI). LLPS, protein oligomerization, structural properties of protein domains, and protein modifications are major components in orchestrating spatio-temporal dynamics of PPI and PNI. Our goal is to integrate PPI/PNI information into development of prediction model for identifying important genetic variants in peach. The program unified inter-database relational keys by protein domains for facilitating inference from the model species. Key advantage of the program lies in the integrated information of related features: LOG, structural characterization of domain (e.g. domain linker, intrinsically disordered regions, DDI, domain-motif (peptide) interaction, beta-sheet and transmembrane helices), and post-translational modification. We provided simple tests to demonstrate how to use the program. The program may be applied to other eukaryotic organisms. The program codes and data are freely available for download at and https://sourceforge.net/projects/artfoundation-log/.
In this article, we introduce a novel bioinformatics program-SeSaMe PS Function (Spore associated Symbiotic Microbes Position Specific Function) -for position-specific functional analysis of short sequences derived from metagenome sequencing data of the arbuscular mycorrhizal fungi. The unique advantage of the program lies in databases created based on genus-specific sequence properties derived from protein secondary structure, namely amino acid usages, codon usages, and codon contexts of three codon DNA 9-mers. SeSaMe PS Function searches a query sequence against reference sequence database, identifies three codon DNA 9-mers with structural roles, and dynamically creates the comparative dataset of 54 microbial genera based on their codon usage biases. The program applies correlation Principal Component Analysis in conjunction with K-means clustering method to the comparative dataset. Three codon DNA 9-mers clustered as sole member or with only a few members are often structurally and functionally distinctive sites that provide useful insights into important molecular interactions. The program provides a versatile means for studying functions of short sequences from metagenome sequencing and has a wide spectrum of applications.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.