The complete genome sequence of Clostridium difficile phage wC2 and comparisons to wCD119 and inducible prophages of CD630 The complete genomic sequence of a previously characterized temperate phage of Clostridium difficile, wC2, is reported. The genome is 56 538 bp and organized into 84 putative ORFs in six functional modules. The head and tail structural proteins showed similarities to that of C. difficile phage wCD119 and Streptococcus pneumoniae phage EJ-1, respectively. Homologues of structural and replication proteins were found in prophages 1 and 2 of the sequenced C. difficile CD630 genome. A putative holin appears unique to the C. difficile phages and was functional when expressed in Escherichia coli. Nucleotide sequence comparisons of wC2 to wCD119 and the CD630 prophage sequences showed relatedness between wC2 and the prophages, but less so to wCD119. wC2 integrated into a gene encoding a putative transcriptional regulator of the gntR family. wC2, wCD119 and CD630 prophage 1 genomes had a Cdu1-attP-integrase arrangement, suggesting that the pathogenicity locus (PaLoc) of C. difficile, flanked by cdu1, has phage origins. The attP sequences of wC2, wCD119 and CD630 prophages were dissimilar. wC2-related sequences were found in 84 % of 37 clinical C. difficile isolates and typed reference strains.
INTRODUCTIONClostridium difficile has emerged as an important intestinal pathogen since the 1970s, continuing to plague hospital settings worldwide and causing recent epidemics in the USA and Canada (Loo et al., 2005;Warny et al., 2005). The major virulence factors of C. difficile are toxins A and B encoded by tcdA and tcdB respectively, which are located on a 19 kb genomic region termed the pathogenicity locus (PaLoc) (Braun et al., 1996;Hammond & Johnson, 1995). The toxins are positively regulated by TcdR (Mani & Dupuy, 2001;Rupnik et al., 2005) and are negatively regulated by TcdC (Hundsberger et al., 1997;Matamouros et al., 2006); they are encoded by tcdR and tcdC respectively, also on the PaLoc. Another toxin-associated gene, tcdE, appears phage related but its function is unknown (Tan et al., 2001). C. difficile acquires antibiotic resistance and virulence genes through plasmids and transposons (Bruggemann, 2005) shown to significantly contribute to genome plasticity (Sebaihia et al., 2006). It is possible that phages also contribute to variance in virulence-associated genes (Lemee et al., 2005) and to the emergence of outbreak strains . However, the prevalence of phage genes within C. difficile genomes is not known. In comparison to phages of Escherichia coli, Staphylococcus aureus and Lactobacillus species, the study of clostridial phages is in its infancy. Only one phage specific for C. difficile, temperate phage wCD119, has been sequenced , while two putative prophage sequences were detected in the recently sequenced genome of C. difficile CD630 (Sebaihia et al., 2006).