Until now, it has been reasonably assumed that specific base-pair recognition is the only mechanism controlling the specificity of transcription factor (TF)âDNA binding. Contrary to this assumption, here we show that nonspecific DNA sequences possessing certain repeat symmetries, when present outside of specific TF binding sites (TFBSs), statistically control TFâDNA binding preferences. We used highthroughput proteinâDNA binding assays to measure the binding levels and free energies of binding for several human TFs to tens of thousands of short DNA sequences with varying repeat symmetries. Based on statistical mechanics modeling, we identify a new proteinâDNA binding mechanism induced by DNA sequence symmetry in the absence of specific base-pair recognition, and experimentally demonstrate that this mechanism indeed governs proteinâDNA binding preferences. proteinâDNA binding is an important biophysical mechanism operating in a living cell (1). This seminal work makes it possible to interpret experiments that measured how transcription factors (TFs) search for their specific target sites flanked by nonconsensus sequence elements (1-10). A specific consensus motif is a short DNA sequence, typically 6-20 base pairs (bp), that possesses an enhanced binding affinity for a particular TF. For example, the sequence CACGTG represents the specific consensus motif for the human protein Max used in this study (Fig. 1). The process of establishing specific, consensus proteinâDNA binding requires the formation of precise geometrical fit between the protein and its consensus DNA motif, accompanied by the formation of specific hydrogen and electrostatic contacts at the proteinâDNA binding interface (6, 7) ( Fig. 1). In addition to binding to their consensus DNA motifs, transcription factors can also bind, albeit with lower affinity, to DNA regions lacking any consensus motifs. The term "nonspecific proteinâDNA binding" (6) is typically used to describe these weaker interactions. Von Hippel and Berg suggested classifying nonspecific proteinâDNA binding into two related mechanisms (6). The first mechanism includes protein binding to its mutated specific motifs that retain some residual, reduced specificity. The second mechanism is largely DNA sequence independent, and it involves electrostatic binding modulated by the overall DNA geometry (6). Despite significant experimental progress, molecular mechanisms responsible for these two types of nonspecific binding remain poorly understood, and the free energy of nonspecific proteinâDNA binding has not been systematically characterized (11)(12)(13)(14). The interplay between consensus and nonconsensus DNA sequence elements emerges as a dominant factor that governs proteinâDNA binding preferences. However, this interplay is also poorly understood (15, 16). Until now, it has been reasonably assumed that specific (consensus) base-pair recognition must control the genome-wide specificity of TFâDNA binding.Contrary to this assumption, here we identify a general mechanism for proteinâDNA bi...