The primary antigen receptor repertoire is sculpted by the process of V(D)J recombination, which must strike a balance between diversification and favoring gene segments with specialized functions. The precise determinants of how often gene segments are chosen to complete variable region coding exons remain elusive. We quantified Vβ use in the preselection Tcrb repertoire and report relative contributions of 13 distinct features that may shape their recombination efficiencies, including transcription, chromatin environment, spatial proximity to their DβJβ targets, and predicted quality of recombination signal sequences (RSSs). We show that, in contrast to functional Vβ gene segments, all pseudo-Vβ segments are sequestered in transcriptionally silent chromatin, which effectively suppresses wasteful recombination. Importantly, computational analyses provide a unifying model, revealing a minimum set of five parameters that are predictive of Vβ use, dominated by chromatin modifications associated with transcription, but largely independent of precise spatial proximity to DβJβ clusters. This learned model-building strategy may be useful in predicting the relative contributions of epigenetic, spatial, and RSS features in shaping preselection V repertoires at other antigen receptor loci. Ultimately, such models may also predict how designed or naturally occurring alterations of these loci perturb the preselection use of variable gene segments.lymphocytes | T-cell receptor | gene regulation G ene activity is regulated at multiple levels to coordinate expression during development. At a most basic level, the collection of cis-acting elements for a genetic locus recruits transcription factors that alter its chromatin environment to either induce or repress gene activity. Emerging studies indicate that the 3D conformation of a locus also plays an important role in the regulation of its composite genes (1). At most genes, many levels of control are integrated to achieve the requisite gene expression state. For example, transcriptional promoters interact with their cognate enhancers over considerable distances in the linear genome to generate "hubs" where the two cis elements are in spatial proximity (1, 2).All of these regulatory strategies are used to generate functional Ig (Ig) and T-cell receptor (Tcr) genes during lymphocyte development (3). Each antigen receptor (AgR) locus is composed of multiple variable (V), joining (J), and sometimes diversity (D) gene segments that are assembled by the process of V (D)J recombination, creating a potential variable region exon (4). Recombination is mediated by the RAG-1/2 enzymatic complex, which is expressed in all developing lymphocytes and recognizes semiconserved recombination signal sequences (RSSs) flanking all AgR gene segments (5). On selection of two compatible gene segments by RAG-1/2, recombination proceeds via a DNA break/repair mechanism, ultimately fusing the two selected segments (4, 5).The assembly of AgR genes is strictly regulated despite a common collection of ge...