Atomistic molecular dynamics (MD) simulations of protein molecules are too computationally expensive to predict most native structures from amino acid sequences. Here, we integrate "weak" external knowledge into folding simulations to predict protein structures, given their sequence. For example, we instruct the computer "to form a hydrophobic core," "to form good secondary structures," or "to seek a compact state." This kind of information has been too combinatoric, nonspecific, and vague to help guide MD simulations before. Within atomistic replica-exchange molecular dynamics (REMD), we develop a statistical mechanical framework, modeling using limited data with coarse physical insight(s) (MELD + CPI), for harnessing weak information. As a test, we apply MELD + CPI to predict the native structures of 20 small proteins. MELD + CPI samples to within less than 3.2 Å from native for all 20 and correctly chooses the native structures (<4 Å) for 15 of them, including ubiquitin, a millisecond folder. MELD + CPI is up to five orders of magnitude faster than brute-force MD, satisfies detailed balance, and should scale well to larger proteins. MELD + CPI may be useful where physics-based simulations are needed to study protein mechanisms and populations and where we have some heuristic or coarse physical knowledge about states of interest.protein folding | molecular dynamics | integrative structural biology | Bayesian inference C omputer modeling is an important source of insights into the properties of protein molecules. There are two main approaches, each with different main areas of applicability: comparative modeling and atomistic molecular dynamics (MD) simulations. Comparative modeling draws inferences from a database of the more than 100,000 known native structures of proteins (1); it is an information-centric approach. A key area of applicability is in predicting the native structures of previously unknown proteins. These methods are often tested in the community-wide blind event for predicting native protein structures, called community assessment of structure prediction (2, 3). In contrast, physics-based atomistic simulations are aimed at computing proper relative populations of the many different states of a system; this type of modeling is an energy-centric approach. Computing proper populations (or, correspondingly, free energies) is essential for elucidating stabilities, motions, and mechanistic actions of protein molecules.Physical simulations offer important advantages in the long run, providing a principled and transferrable basis for understanding properties; the capability to go beyond just native structures alone to dynamics, binding, folding, and mechanisms; applicability where databases are limited, including membrane proteins or other foldable polymers, such as peptoids (4); and extensibility to other temperatures, solvents, and binding conditions, for example. A proper physical model requires a plausible physical energy function that can accurately predict native structures (validation); that app...