All cellular proteins are derived from preexisting ones by natural selection. Because of the random nature of this process, many potentially useful protein structures never arose or were discarded during evolution. Here, we used a single round of genetic selection in mouse cells to isolate chemically simple, biologically active transmembrane proteins that do not contain any amino acid sequences from preexisting proteins. We screened a retroviral library expressing hundreds of thousands of proteins consisting of hydrophobic amino acids in random order to isolate four 29-aa proteins that induced focus formation in mouse and human fibroblasts and tumors in mice. These proteins share no amino acid sequences with known cellular or viral proteins, and the simplest of them contains only seven different amino acids. They transformed cells by forming a stable complex with the platelet-derived growth factor β receptor transmembrane domain and causing ligand-independent receptor activation. We term this approach de novo selection and suggest that it can be used to generate structures and activities not observed in nature, create prototypes for novel research reagents and therapeutics, and provide insight into cell biology, transmembrane protein-protein interactions, and possibly virus evolution and the origin of life.synthetic biology | protein engineering | receptor tyrosine kinase | E5 protein | traptamer A ll extant naturally occurring proteins are derived from preexisting ones by the incremental process of natural selection. Cellular proteins are thus the products of a long chain of evolutionary decisions driven by chance mutations and selective pressures, which allowed some profitable lineages to flourish but drove many others to extinction. Indeed, because of the immense diversity that can result from the arrangement of 20 different amino acids into long polymers, the roster of distinct proteins on earth is a miniscule fraction of possible chemical structures (1). Because they are trapped by their evolutionary history, naturally occurring proteins are not necessarily the optimal proteins for accomplishing a given task. Therefore, one of the major goals of synthetic biology is to generate artificial proteins for a variety of practical applications, such as optimizing catalytic activity and even developing entirely new catalytic activities, generating molecules suitable for industrial processes, and fabricating novel biomaterials.Computational methods have been developed to construct artificial proteins based on detailed understanding of the chemistry, energetics, and structure-function relationships of existing proteins (2-4). This process, known as de novo or computational protein design, typically starts from a known protein structure or a conserved motif and iteratively samples numerous parameters such as amino acid substitutions and side-chain rotamers to identify predicted low-energy states. Although this approach requires enormous computational power, it has been used successfully for both globular and transmemb...