A consensus sequence for the human long interspersed repeated DNA element, L1Hs (LINE or KpnI sequence), is presented. The sequence contains two open reading frames (ORFs) which are homologous to ORFs in corresponding regions of L1 elements in other species. The L1Hs ORFs are separated by a small evolutionarily nonconserved region. The 5' end of the consensus contains frequent terminators in all three reading frames and has a relatively high GC content with numerous stretches of weak homology with AluI repeats. The 5' ORF extends for a minimum of 723 bp (241 codons). The 3' ORF is 3843 bp (1281 codons) and predicts a protein of 149 kD which has regions of weak homology to the polymerase domain of various reverse transcriptases. The 3' end of the consensus has a 208-bp nonconserved region followed by an adenine-rich end. The organization of the L1Hs consensus sequence resembles the structure of eukaryotic mRNAs except for the noncoding region between ORFs. However, due to base substitutions or truncation most elements appear incapable of producing mRNA that can be translated. Our observation that individual elements cluster into subfamilies on the basis of the presence or absence of blocks of sequence, or by the linkage of alternative bases at multiple positions, suggests that most L1 sequences were derived from a small number of structural genes. An estimate of the mammalian L1 substitution rate was derived and used to predict the age of individual human elements. From this it follows that the majority of human L1 sequences have been generated within the last 30 million years. The human elements studied here differ from each other, yet overall the L1Hs sequences demonstrate a pattern of species-specificity when compared to the L1 families of other mammals. Possible mechanisms that may account for the origin and evolution of the L1 family are discussed. These include pseudogene formation (retroposition), transposition, gene conversion, and RNA recombination.
A collaborative study involving a large sample of European Americans was typed for the histocompatibility loci of the HLA DR-DQ region and subjected to intensive typing validation measures in order to accurately determine haplotype composition and frequency. The resulting tables have immediate application to HLA typing and allogeneic transplantation. The loci within the DR-DQ region are especially valuable for such an undertaking because of their tight linkage and high linkage disequilibrium. The 3798 haplotypes, derived from 1899 unrelated individuals, had a total of 75 distinct DRB1-DQA1-DQB1 haplotypes. The frequency distribution of the haplotypes was right skewed with haplotypes occurring at a frequency of less than 1% numbering 59 and yet constituting less than 12% of the total sample. Given DRB1 typing, it was possible to infer the exact DQA1 and DQB1 composition of a haplotype with high confidence (>90% likelihood) in 21 of the 35 high-resolution DRB1 alleles present in the sample. Of the DRB1 alleles without high reliability for DQ haplotype inference, only *0401, *0701 and *1302 were common, the remaining 11 DRB1 alleles constituting less than 5% of the total sample. This approach failed for the 13 serologically equivalent DR alleles in which only 33% of DQ haplotypes could be reliably inferred. The 36 DQA1-DQB1 haplotypes present in the total sample conformed to the known pattern of permissible heterodimers. Four DQA1-DQB1 haplotypes, all rare, are reported here for the first time. The haplotype frequency tables are suitable as a reference standard for HLA typing of the DR and DQ loci in European Americans.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.