From a consideration of (qO, k6) values of the amino acids of myoglobin, lysozyme, the a and ft chains of horse oxyhemoglobin, tosyl-a-chymotrypsin, and carboxypeptidase A, an empirical procedure of predicting whether amino-acid residues in proteins are in a non-helical or may be in a helical conformation has been developed. The conformation of an amino acid at any position n is considered to be influenced by its nearest neighbors (the amino acids at positions n + 1 and n -1), and the (v,, (n -I)-(n)-(n + 1) taken sequentially for the entire chain was constructed; it lists the number of instances in which helical and non-helical conformations for the amino acids at position n were found. Certain sequences are found to be associated exclusively with non-helical and others exclusively with helical conformations, whereas many sequences may be either helical or non-helical. The distribution of non-helical residues serves to limit stretches of permissively helical regions; these are then further examined by the helical wheel method. As applied to cytochrome c from 18 species, the only permissively helical segment found was the stretch 91-101 near the Cterminus. For the variable regions of three light and three heavy chains of immunoglobulins, upper limits of 12 and 17% a-helix, respectively, were obtained.The prediction of secondary and tertiary structures of protein molecules from their primary sequence has been explored by Scheraga (1) and by Levinthal (2) for ribonuclease and cytochrome c respectively, following the suggestion by Anfinsen and Redfield (3) that the minimum energy state of a peptide chain should determine its folding pattern. Unfortunately, nonbonding interactions between atoms have not been determined with sufficient precision (4), and experimental findings on the structures of ribonuclease (5) and cytochrome c (6) have not substantiated the earlier theoretical predictions.However, several empirical procedures (7-12) have been developed to compare amino-acid sequences with identical or similar sequences in proteins of known three-dimensional structure. In a recent paper on the structure of cytochrome c, Dickerson et al. (13) remark on the value of such methods and compare the predictions with the structure.In this communication we are using the c and ,6 angles (14) of sequences in five known proteins to estimate the probability of any given residue being in a non-helical conformation or of permitting it to be in a helical conformation.
1501The method involves estimating the influence of the nearestneighbor amino acid on the conformation of each residue, location of helix-breaking sequences from a frequency table of helical and non-helical occurrences, and then examining the permissively helical sequences by the helical wheel analysis (10). The procedure, as applied to cytochrome c sequences from 18 species, identified the sole helical segment as 91-101, near the C-terminus. Numerous helix-breaking residues could be seen in ribonuclease, and several helices predicted by others could be e...