FASTA and FASTQ are basic and ubiquitous formats for storing nucleotide and protein sequences. Common manipulations of FASTA/Q file include converting, searching, filtering, deduplication, splitting, shuffling, and sampling. Existing tools only implement some of these manipulations, and not particularly efficiently, and some are only available for certain operating systems. Furthermore, the complicated installation process of required packages and running environments can render these programs less user friendly. This paper describes a cross-platform ultrafast comprehensive toolkit for FASTA/Q processing. SeqKit provides executable binary files for all major operating systems, including Windows, Linux, and Mac OSX, and can be directly used without any dependencies or pre-configurations. SeqKit demonstrates competitive performance in execution time and memory usage compared to similar tools. The efficiency and usability of SeqKit enable researchers to rapidly accomplish common FASTA/Q file manipulations. SeqKit is open source and available on Github at https://github.com/shenwei356/seqkit.
Background
Streptococcus suis serotype 2 (S. suis 2, SS2) has evolved into a highly infectious entity, which caused the two recent large-scale outbreaks of human SS2 epidemic in China, and is characterized by a toxic shock-like syndrome. However, the molecular pathogenesis of this new emerging pathogen is still poorly understood.Methodology/Principal Findings89K is a newly predicted pathogenicity island (PAI) which is specific to Chinese epidemic strains isolated from these two SS2 outbreaks. Further bioinformatics analysis revealed a unique two-component signal transduction system (TCSTS) located in the candidate 89K PAI, which is orthologous to the SalK/SalR regulatory system of Streptococcus salivarius. Knockout of salKR eliminated the lethality of SS2 in experimental infection of piglets. Functional complementation of salKR into the isogenic mutant ΔsalKR restored its soaring pathogenicity. Colonization experiments showed that the ΔsalKR mutant could not colonize any susceptible tissue of piglets when administered alone. Bactericidal assays demonstrated that resistance of the mutant to polymorphonuclear leukocyte (PMN)-mediated killing was greatly decreased. Expression microarray analysis exhibited a transcription profile alteration of 26 various genes down-regulated in the ΔsalKR mutant.Conclusions/SignificanceThese findings suggest that SalK/SalR is requisite for the full virulence of ethnic Chinese isolates of highly pathogenic SS2, thus providing experimental evidence for the validity of this bioinformatically predicted PAI.
Pathogenicity islands (PAIs), a distinct type of genomic island (GI), play important roles in the rapid adaptation and increased virulence of pathogens. 89K is a newly identified PAI in epidemic Streptococcus suis isolates that are related to the two recent large-scale outbreaks of human infection in China. However, its mechanism of evolution and contribution to the epidemic spread of S. suis 2 remain unknown. In this study, the potential for mobilization of 89K was evaluated, and its putative transfer mechanism was investigated. We report that 89K can spontaneously excise to form an extrachromosomal circular product. The precise excision is mediated by an 89K-borne integrase through site-specific recombination, with help from an excisionase. The 89K excision intermediate acts as a substrate for lateral transfer to non-89K S. suis 2 recipients, where it reintegrates site-specifically into the target site. The conjugal transfer of 89K occurred via a GI type IV secretion system (T4SS) encoded in 89K, at a frequency of 10−6 transconjugants per donor. This is the first demonstration of horizontal transfer of a Gram-positive PAI mediated by a GI-type T4SS. We propose that these genetic events are important in the emergence, pathogenesis and persistence of epidemic S. suis 2 strains.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.