Recent evidence demonstrates that novel protein-coding genes can arise de novo from intergenic loci. This evolutionary innovation is thought to be facilitated by the pervasive translation of intergenic transcripts, which exposes a reservoir of variable polypeptides to natural selection. Do intergenic translation events yield polypeptides with useful biochemical capacities?The answer to this question remains controversial. Here, we systematically characterized how de novo emerging coding sequences impact fitness. In budding yeast, overexpression of these sequences was enriched in beneficial effects, while their disruption was generally inconsequential.We found that beneficial emerging sequences have a strong tendency to encode putative transmembrane proteins, which appears to stem from a cryptic propensity for transmembrane signals throughout thymine-rich intergenic regions of the genome. These findings suggest that novel genes with useful biochemical capacities, such as transmembrane domains, tend to evolve de novo within intergenic loci that already harbored a blueprint for these capacities. 3 The molecular mechanisms and dynamics of de novo gene birth are poorly understood 1 . It is particularly unclear how non-genic sequences could spontaneously encode proteins with specific and useful biochemical capacities. To resolve this paradox, it has been proposed that pervasive translation of non-genic transcripts can expose genetic variation, in the form of novel polypeptides, to natural selection, thereby purging toxic sequences and providing adaptive potential to the 5 organism 2,3 . The genomic sequences encoding these novel polypeptides have been called "protogenes", to denote that they correspond to a distinct class of genetic elements that are intermediates between non-genic sequences and established genes 3 . In agreement, several studies reported that de novo emerging coding sequences tend to display lengths, transcript architectures, transcription levels, strength of purifying selection, sequence compositions, structural features and integration 10 in cellular networks that are intermediate between those observed in non-genic sequences and those observed in established genes 3-8 . Furthermore, pervasive translation of non-genic sequences has been observed repeatedly by ribosome profiling and proteo-genomics 3,[9][10][11][12] , and studies have shown that random sequence libraries harbor bioactive effects [13][14][15][16][17] . Nonetheless, whether and how native proto-genes carry adaptive potential remains unknown. 15 We sought to formalize the predictions of adaptive proto-gene evolution. We define adaptive potential as the capacity to increase fitness by means of evolutionary change. While any sequence may in theory carry adaptive potential, changes in established genes are typically constrained by preexisting selected effects -the specific physiological processes mediated by the gene products that lead to their evolutionary conservation 18 . In contrast, emerging proto-genes are 20 expected to mostly lack ...