Modified DNA bases in mammalian genomes, such as 5-methylcytosine ( 5m C) and its oxidized forms, are implicated in important epigenetic regulation processes. In human or mouse, successive enzymatic conversion of 5m C to its oxidized forms is carried out by the ten-eleven translocation (TET) proteins. Previously we reported the structure of a TET-like 5m C oxygenase (NgTET1) from Naegleria gruberi, a single-celled protist evolutionarily distant from vertebrates. Here we show that NgTET1 is a 5-methylpyrimidine oxygenase, with activity on both 5m C (major activity) and thymidine (T) (minor activity) in all DNA forms tested, and provide unprecedented evidence for the formation of 5-formyluridine ( 5f U) and 5-carboxyuridine ( 5ca U) in vitro. Mutagenesis studies reveal a delicate balance between choice of 5m C or T as the preferred substrate. Furthermore, our results suggest substrate preference by NgTET1 to 5m CpG and TpG dinucleotide sites in DNA. Intriguingly, NgTET1 displays higher T-oxidation activity in vitro than mammalian TET1, supporting a closer evolutionary relationship between NgTET1 and the base J-binding proteins from trypanosomes. Finally, we demonstrate that NgTET1 can be readily used as a tool in 5m C sequencing technologies such as single molecule, realtime sequencing to map 5m C in bacterial genomes at base resolution.odified DNA bases exist in all forms of life, from viruses to mammals with many different biological roles. Accordingly, diverse mechanisms have evolved to "write," "read," and "erase" these modifications. In mammals, 5-methylcytosine ( 5m C) is the major form of DNA modification and is implicated in many crucial developmental processes. In human and mouse, 5m C can be successively oxidized into 5-hydroxymethylcytosine ( 5hm C), 5-formylcytosine ( 5f C), and 5-carboxylcytosine ( 5ca C) by the teneleven translocation (TET) family of oxygenases (1-4). The bases of 5f C and 5ca C can be excised by thymine DNA glycosylase (4). The 5m C-oxidation-coupled base-excision repair pathway provides a plausible route for active demethylation in mammalian cells. Many other species, from simple to complex, maintain DNA methylation machinery throughout their life cycle that may contribute to epigenetic regulation. Therefore, an interesting perspective is to examine shared and distinct features of TET oxygenases in diverse eukaryotes (5, 6).The human and mouse genomes encode three paralogous TET proteins, TET1, TET2, and TET3, which presumably carry out both redundant and distinct functions (7,8). TET proteins belong to the diverse group of α-ketoglutarate (αKG) and Fe(II)-dependent oxygenases (5). Subgroup classification based on sequence similarity links the TET proteins to base J-binding proteins (JBP1 and JBP2), which are primarily present in trypanosomes and possess thymidine (T)-hydroxylation activity (1). Further bioinformatic analysis revealed eight paralogous TET/ JBP-like genes in the genome of Naegleria gruberi, a single-celled amoeboflagellate protist that is a distant cousin of the par...