Recent advances in protein design rely on rational and computational approaches to create novel sequences that fold and function. In contrast, natural systems selected functional proteins without any design a priori. In an attempt to mimic nature, we used large libraries of novel sequences and selected for functional proteins that rescue Escherichia coli cells in which a conditionally essential gene has been deleted. In this way, the de novo protein SynSerB3 was selected as a rescuer of cells in which serB, which encodes phosphoserine phosphatase, an enzyme essential for serine biosynthesis, was deleted. However, SynSerB3 does not rescue the deleted activity by catalyzing hydrolysis of phosphoserine. Instead, SynSerB3 upregulates hisB, a gene encoding histidinol phosphate phosphatase. This endogenous E. coli phosphatase has promiscuous activity that, when overexpressed, compensates for the deletion of phosphoserine phosphatase. Thus, the de novo protein SynSerB3 rescues the deletion of serB by altering the natural regulation of the His operon.O ne of the key goals of synthetic biology is to design novel proteins that fold and function in vivo. A particularly challenging objective would be to produce nonnatural proteins that do not merely generate interesting phenotypes but actually provide essential functions necessary for the growth of living cells. The successful design of such life-sustaining proteins would represent an initial step toward constructing artificial "proteomes" of nonnatural sequences.We initiated work toward artificial proteomes by constructing large combinatorial libraries of novel sequences designed to fold into stable four-helix structures (1, 2). Our libraries were based on a strategy for protein design, which assumes that the overall fold of a simple structure can be specified by the pattern of polar and nonpolar residues in the linear sequence. Because only the type of residue-polar versus nonpolar-is specified, this strategy has been called a "binary code" for protein design (1-3). At the same time, because the exact identities of the side chains at each polar and nonpolar position are not specified explicitly, this strategy is well suited for constructing large combinatorial libraries of novel sequences (3). To express these libraries of binary-patterned sequences in vivo, we construct collections of synthetic genes using degenerate DNA codons. For example, the degenerate codon NTN (N = A,T,C,G) is used to encode five nonpolar residues, and the degenerate codon VAN (V = A,C,G) is used to encode six polar residues.We have shown previously that several proteins from these binary patterned libraries fold into stable four-helix bundles, and both crystallographic and NMR structures have been determined (4-6). Moreover, in initial steps probing the potential for functional activity, proteins from these libraries were shown to bind small molecules, including cofactors, and to catalyze rudimentary reactions (7,8). Although those experiments screened a subset of library proteins for activity in ...