Novosphingobium nitrogenifigens was originally isolated from pulp and paper mill wastewater, a low-nitrogen, high-carbon environment. N. nitrogenifigens is the first known nitrogen-fixing, polyhydroxyalkanoate-accumulating sphingomonad, and we report the annotated draft genome sequence of the type strain Y88 T here.
Sphingomonad bacteria are often found in high-carbon, lownitrogen, environmentally degraded habitats and/or are capable of metabolizing various industrial pollutants as carbon sources (1,5,6,9). Several of these species have also been found to produce polyhydroxyalkanoate (PHA) (1, 4, 6), which represents a class of polymers with properties comparable to those of petroleum-derived plastics (2). The potential of diazotrophic bacteria that can use waste-derived feedstocks to help lower the cost of PHA production is an active area of research (8). Sphingomonads can potentially serve as PHA production strains in nitrogen-poor waste stream substrates, based on their abilities to thrive in such environments.Novosphingobium nitrogenifigens Y88 T is a Gram-negative, obligate aerobe sphingomonad that was originally isolated from nickel-enriched pulp and paper mill wastewater (1). N. nitrogenifigens is capable of nitrogen fixation and production of poly-3-hydroxybutyrate (PHB). N. nitrogenifigens has also proven effective in the removal of Mn and Zn from model biorefinery effluent (A. Palumbo et al., submitted for publication). These metabolic properties led us to further explore the characteristics of this organism aided by its genome sequence.The draft genome of N. nitrogenifigens Y88 T (ICMP 16470 T , DSM 19370 T ) was obtained via Roche 454 pyrosequencing (454 Life Sciences, Branford CT) combining 454 GS 20 and 454 GS FLX Titanium paired-end sequencing (1,151,296 quality reads). Using Newbler 2.3 (454 Life Sciences), these reads were assembled into 19 scaffolds comprising 77 contigs at 60-fold coverage, totaling 4.14 Mbp. GϩC content was 63%. Annotation of predicted coding sequences (CDS) was performed using Glimmer version 3.02 (3), which identified 3,801 putative protein-coding genes. Of these CDS, 1,056 were identified as hypothetical/unknown function predicted proteins. Additionally, 49 tRNA genes were identified using the stand-alone version of tRNAscan-SE, v1.23 (7).Two small, related plasmids named py8 (5,911 bp) and py157 (5,720 bp) were also identified and independently confirmed (Quanfeng Liang and Gareth Lloyd-Jones, unpublished results). In contrast to the genome sequence, 363-fold (py8) and 465-fold (py157) coverage of the plasmids was achieved. The GϩC content of both plasmids was 58%. Neither of the plasmids harbored any CDS similar to known genes other than rep, mob, and traA genes, involved in plasmid conjugation and replication functions. Interestingly, these genes and a number of hypothetical CDS possess the greatest similarity to CDS in Sphingobium japonicum UT26S plasmid pUT2 (GenBank accession no. AP010807), a molecule with a molecular weight similar to those of py8 and py157.As expec...