Admixture between genetically different populations may produce gametic association between gene loci as a function of the genetic difference between parental populations and the admixture rate. This association decays as a function of time since admixture and the recombination rate between the loci. Admixture between genetically long-separated human populations has been frequent in the centuries since the age of exploration and colonization, resulting in numerous hybrid descendant populations today, as in the Americas. This represents a natural experiment for genetic epidemiology and anthropology, in which to use polymorphic marker loci (e.g., restriction fragment length polymorphisms) and disequilibrium to infer a genetic basis for traits of interest. In this paper we show that substantial disequilibrium remains today under widely applicable situations, which can be detected without requiring inordinately dose linkage between trait and marker loci. Very disparate parental allele frequencies produce large disequilibrium, but the sample size needed to detect such levels of disequilibrium can be large due to the skewed haplotype frequency distribution in the admixed population. Such situations, however, provide power to differentiate between disequilibrium due just to population mixing from that due to physical linkage of loci-i.e., to help map the genetic locus of the trait. A gradient of admixture levels between the same parental populations may be used to test genetic models by relating admixture to disequilibrium levels. Admixture between two populations with different allele frequencies at two loci will produce a gametic association between these loci in any admixed population (1). Here, we refer to such gametic association as "mixture disequilibrium" to distinguish it from gametic association between closely linked loci. Such mixture disequilibrium will decay over time, but Earlier, we (3) showed the utility of using admixed populations for fitting genetic models of inheritance of complex diseases. The objective of the present paper is to show that the observed levels of disequilibrium between any two loci in such an array of admixed populations may be used to detect their linkage relationship and to differentiate the case of mixture disequilibrium between loci from the disequilibrium that can be ascribed to genetic linkage.MATHEMATICAL TREATMENT Mixture Disequilibrium in an Admixed Population. As in the case of traditional admixture models, we consider two loci (A and B) that are not affected by selection. Let A and a, B and b be the two segregating alleles at these loci, respectively. Suppose that an admixed population (Z) obtains a fraction (m) of its genes from ancestral population X, and a fraction (1 -m) from ancestral population Y. We assume that the admixture event has taken place in a single pulse at generation 0, and the populations are surveyed t generations after this event. This theory is discussed in more detail elsewhere (ref. 2 and unpublished work). Let r denote the recombination ...
Lipoprotein lipase plays a central role in lipid metabolism and the gene that encodes this enzyme (LPL) is a candidate susceptibility gene for cardiovascular disease. Here we report the complete sequence of a fraction of the LPL gene for 71 individuals (142 chromosomes) from three populations that may have different histories affecting the organization of the sequence variation. Eighty-eight sites in this 9.7 kb vary among individuals from these three populations. Of these, 79 were single nucleotide substitutions and 9 sites involved insertion-deletion variations. The average nucleotide diversity across the region was 0.2% (or on average 1 variable site every 500 bp). At 34 of these sites, the variation was found in only one of the populations, reflecting the differing population and mutational histories. If LPL is a typical human gene, the pattern of sequence variation that exists in introns as well as exons, even for the small number of samples considered here, will present challenges for the identification of sites, or combinations of sites, that influence variation in risk of disease in the population at large.
Three common protein isoforms of apolipoprotein E (apoE), encoded by the epsilon2, epsilon3, and epsilon4 alleles of the APOE gene, differ in their association with cardiovascular and Alzheimer's disease risk. To gain a better understanding of the genetic variation underlying this important polymorphism, we identified sequence haplotype variation in 5.5 kb of genomic DNA encompassing the whole of the APOE locus and adjoining flanking regions in 96 individuals from four populations: blacks from Jackson, MS (n=48 chromosomes), Mayans from Campeche, Mexico (n=48), Finns from North Karelia, Finland (n=48), and non-Hispanic whites from Rochester, MN (n=48). In the region sequenced, 23 sites varied (21 single nucleotide polymorphisms, or SNPs, 1 diallelic indel, and 1 multiallelic indel). The 22 diallelic sites defined 31 distinct haplotypes in the sample. The estimate of nucleotide diversity (site-specific heterozygosity) for the locus was 0.0005+/-0.0003. Sequence analysis of the chimpanzee APOE gene showed that it was most closely related to human epsilon4-type haplotypes, differing from the human consensus sequence at 67 synonymous (54 substitutions and 13 indels) and 9 nonsynonymous fixed positions. The evolutionary history of allelic divergence within humans was inferred from the pattern of haplotype relationships. This analysis suggests that haplotypes defining the epsilon3 and epsilon2 alleles are derived from the ancestral epsilon4s and that the epsilon3 group of haplotypes have increased in frequency, relative to epsilon4s, in the past 200,000 years. Substantial heterogeneity exists within all three classes of sequence haplotypes, and there are important interpopulation differences in the sequence variation underlying the protein isoforms that may be relevant to interpreting conflicting reports of phenotypic associations with variation in the common protein isoforms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.