BackgroundVariable number of tandem repeats (VNTRs) that are widely distributed in the genome of Yersinia pestis proved to be useful markers for the genotyping and source-tracing of this notorious pathogen. In this study, we probed into the features of VNTRs in the Y. pestis genome and developed a simple hierarchical genotyping system based on optimized VNTR loci.Methodology/Principal FindingsCapillary electrophoresis was used in this study for multi-locus VNTR analysis (MLVA) in 956 Y. pestis strains. The general features and genetic diversities of 88 VNTR loci in Y. pestis were analyzed with BioNumerics, and a “14+12” loci-based hierarchical genotyping system, which is compatible with single nucleotide polymorphism-based phylogenic analysis, was established.Conclusions/SignificanceAppropriate selection of target loci reduces the impact of homoplasies caused by the rapid mutation rates of VNTR loci. The optimized “14+12” loci are highly discriminative in genotyping and source-tracing Y. pestis for molecular epidemiological or microbial forensic investigations with less time and lower cost. An MLVA genotyping datasets of representative strains will improve future research on the source-tracing and microevolution of Y. pestis.
We previously released the Anti-CRISPRdb database hosting anti-CRISPR proteins (Acrs) and associated information. Since then, the number of known Acr families, types, structures and inhibitory activities has accumulated over time, and Acr neighbors can be used as a candidate pool for screening Acrs in further studies. Therefore, we here updated the database to include the new available information. Our newly updated database shows several improvements: (i) it comprises more entries and families because it includes both Acrs reported in the most recent literatures and Acrs obtained via performing homologous alignment; (ii) the prediction of Acr neighbors is integrated into Anti-CRISPRdb v2.2, and users can identify novel Acrs from these candidates; and (iii) this version includes experimental information on the inhibitory strength and stage for Acr-Cas/Acr-CRISPR pairs, motivating the development of tools for predicting specific inhibitory abilities. Additionally, a parameter, the rank of codon usage bias (CUBRank), was proposed and provided in the new version, which showed a positive relationship with predicted result from AcRanker; hence, it can be used as an indicator for proteins to be Acrs. CUBRank can be used to estimate the possibility of genes occurring within genome island―a hotspot hosting potential genes encoding Acrs. Based on CUBRank and Anti-CRISPRdb, we also gave the first glimpse for the emergence of Acr genes (acrs).
Database URL
http://guolab.whu.edu.cn/anti-CRISPRdb
Anti-CRISPR proteins (Acrs) can suppress the activity of CRISPR-Cas systems. Some viruses depend on Acrs to expand their genetic materials into the host genome which can promote species diversity. Therefore, the identification and determination of Acrs are of vital importance. In this work we developed a random forest tree-based tool, AcrDetector, to identify Acrs in the whole genomescale using merely six features. AcrDetector can achieve a mean accuracy of 99.65%, a mean recall of 75.84%, a mean precision of 99.24% and a mean F1 score of 85.97%; in multi-round, 5-fold crossvalidation (30 different random states). To demonstrate that AcrDetector can identify real Acrs precisely at the whole genome-scale we performed a cross-species validation which resulted in 71.43% of real Acrs being ranked in the top 10. We applied AcrDetector to detect Acrs in the latest data. It can accurately identify 3 Acrs, which have previously been verified experimentally. A standalone version of AcrDetector is available at https://github.com/RiversDong/AcrDetector. Additionally, our result showed that most of the Acrs are transferred into their host genomes in a recent stage rather than early.
Song (2018) The complete chloroplast genome sequence of an Endangered orchid species Dendrobiumbellatulum (Orchidaceae), Mitochondrial DNA Part B, 3:1, 233-234,
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.