Sequences of each of the four framework segments FRI, FR2, FR3, and FR4 of the variable regions (V-regions) of light and heavy chains of immunoglobulins were grouped into sets with identical sequences. Sets contained from 1 to 18 members. When each V-region was traced from one FR to the next, it was seen that members of the same set in FRI could be associated with different sets in FR2, FR3, and FR4. This suggests that the framework for the light and heavy chain V-regions is assembled during embryonic development from sets of minigenes for each FR segment. FR4 The variable region (V-region) of immunoglobulin light and heavy chains may be divided into four framework (FR) and three complementarity-determining [hypervariable (1)] regions or segments (CDR). In the V-region of light chains (VL), FR1, FR2, FR3, and FR4 comprise residues 1-23,35-49,57-88, and 98-107, respectively, and CDR1, CDR2, and CDR3 comprise residues 24-34, 50-56, and 89-97, respectively (1, 2). CDR1 and CDR3 may vary in length due to insertions (1-3, cf. 4). In the V-region of heavy chains (VH), the FR are 1-30, 36-49, 66-94, and 103-113 and CDR1, CDR2, and CDR3 are residues 31-35B, 50-65, and 95-102; CDR1, CDR2, CDR3, and FR3 may also have insertions (cf. 2). The light (5-7) and the heavy chains (8, 9) have been divided into subgroups based essentially on sequences in FR1. Each framework sequence for a subgroup is considered to represent a single structural gene. Recently, Potter (10) (11), for which the nucleic acid of the mouse VA gene was sequenced (12) from 12-day-old mouse embryo DNA, can be interpreted as having been assembled from FRI and FR3 from MOPC 315 (a VATI) and FR2 (except for Glu vs. Gln as discussed below) from MOPC 104E (a VAT); CDR1 had the same sequence in both VAT and VAIT whereas CDR2 and CDR3 had sequences of both VAT and VxIT. In line with recent findings in viral and mammalian genes, the present analysis predicts that intervening sequences will be found between the FR fragments and that the CDR will be recombined into the FR segments during early embryonic development with elimination of the intervening sequences to assemble a V-region gene. Intervening sequences (13) have now been found in a mouse f3-globin (14,15), rabbit globin (16), avian ovalbumin (17, 18), viral proteins (19-25), some ribosomal DNA genes in , and tRNA genes in yeast (30).
RESULTSThe data base (2) of V-region sequences supplemented by additional published data (31-46) contained complete and partial sequences of 509 light and 225 heavy chains. Sequences of human VJT, mouse VK, rabbit VK, and human and mouse VHIIT, the only groups with sufficient data, were sorted so that identical sequences of FRI, FR2, FR3, and FR4 were located; Glx and Asx were accepted as equivalent to Gln or Glu and Asn or Asp in assembling sets. These were grouped further and exAbbreviations: V-region, variable region; VL, VH, variable regions of light and heavy chains, respectively; V,>, variable regions of K light chains; VXA, VAII, variable regions of two subgro...