Protein tertiary structures are known to be encoded in amino acid sequences, but the problem of structure prediction from sequence continues to be a challenge. With this question in mind, recent simulations have shown that atomic burials, as expressed by atom distances to the molecular geometrical center, are sufficiently informative for determining native conformations of small globular proteins. Here we use a simple computational experiment to estimate the amount of this required burial information and find it to be surprisingly small, actually comparable with the stringent limit imposed by sequence statistics. Atomic burials appear to satisfy, therefore, minimal requirements for a putative dominating property in the folding code because they provide an amount of information sufficiently large for structural determination but, at the same time, sufficiently small to be encodable in sequences. In a simple analogy with human communication, atomic burials could correspond to the actual "language" encoded in the amino acid "script" from which the complexity of native conformations is recovered during the folding process.protein folding | structure prediction | information theory | folding code D uring the last two decades, a physical picture of the folding process has emerged with the advent of energy landscape theory (1-5) but, despite many recent advances, a general solution to the problem of structure prediction from sequence has remained elusive. Most attempts in this direction have assumed sequences to encode partial information about many structural properties, such as likelihood of tertiary contacts or secondary structure propensities, that could eventually be combined to provide a general predictive algorithm (6-10). An alternative scheme would assume a single (or few) conformational property to be directly encoded in sequences, resulting in a small number of sequence-dependent parameters, whereas other conformational features would arise from sequence-independent constraints. The importance of such constraints has been recently emphasized by Banavar and collaborators (11).The amount of information provided by a putative single property dominating the code should satisfy two conditions: It should be sufficiently large for structural determination but sufficiently small for being encodable in sequences (12). The widely recognized importance of hydrophobic interactions on protein structure formation (13,14) suggests atomic burials to constitute a natural candidate for this putative dominant property. There has been some discussion, in the simplified context of lattice models, on the possibility that intrinsically unspecific hydrophobicity could satisfy the first condition (15), including a dependence on the choice of native conformation (16,17). Encouraging results from recent Monte Carlo simulations, on the other hand, indicate that the first condition is satisfied by atomic burials, as measured by distances from the molecular geometrical center, for small globular proteins represented by off-lattice, geome...