AbstractThe origin of eukaryotes is one of evolution’s most important transitions, yet it is still poorly understood. Evidence for how it occurred should be preserved in eukaryotic genomes. Based on phylogenetic trees from ribosomal RNA and ribosomal proteins, eukaryotes are typically depicted as branching together with or within archaea. This ribosomal affiliation is widely interpreted as evidence for an archaeal origin of eukaryotes. However, the extent to which the archaeal ancestry of genes for the cytosolic ribosomes of eukaryotic cells is representative for the rest of the eukaryotic genome is unknown. Here we have clustered 19,050,992 protein sequences from 5,443 bacteria and 212 archaea with 3,420,731 protein sequences from 150 eukaryotes spanning six eukaryotic supergroups to identify genes that link eukaryotes exclusively to bacteria and archaea respectively. By downsampling the bacterial sample we obtain estimates for the bacterial and archaeal proportions of genes among 150 eukaryotic genomes. Eukaryotic genomes possess a bacterial majority of genes. On average, eukaryotic genes are 56% bacterial in origin. The majority drops to 53% in eukaryotes that never possessed plastids, and increases to 61% in photosynthetic eukaryotic lineages, where the cyanobacterial ancestor of plastids contributed additional genes to the eukaryotic genome, reaching 67% in higher plants. Intracellular parasites, which undergo reductive evolution in adaptation to the nutrient rich environment of the cells that they infect, relinquish bacterial genes for metabolic processes. In the current sample, this process of adaptive gene loss is most pronounced in the human parasite Encephalitozoon intestinalis with 86% archaeal and 14% bacterial derived genes. The most bacterial eukaryote genome sampled is rice, with 67% bacterial and 33% archaeal genes. The functional dichotomy, initially described for yeast, of archaeal genes being involved in genetic information processing and bacterial genes being involved in metabolic processes is conserved across all eukaryotic supergroups.