Similarity between two individuals in the combination of genetic markers along their chromosomes indicates shared ancestry and can be used to identify historical connections between different population groups due to admixture. We use a genome-wide, haplotype-based, analysis to characterise the structure of genetic diversity and gene-flow in a collection of 48 sub-Saharan African groups. We show that coastal populations experienced an influx of Eurasian haplotypes over the last 7000 years, and that Eastern and Southern Niger-Congo speaking groups share ancestry with Central West Africans as a result of recent population expansions. In fact, most sub-Saharan populations share ancestry with groups from outside of their current geographic region as a result of gene-flow within the last 4000 years. Our in-depth analysis provides insight into haplotype sharing across different ethno-linguistic groups and the recent movement of alleles into new environments, both of which are relevant to studies of genetic epidemiology.DOI: http://dx.doi.org/10.7554/eLife.15266.001
We conducted a genome-wide association study of host resistance to severe Plasmodium falciparum malaria in over 17,000 individuals from 11 malariaendemic countries, undertaking a wide ranging analysis which identifies five replicable associations with genome-wide levels of evidence. Our findings include a newly implicated variant on chromosome 6 associated with risk of cerebral malaria, and the discovery of an erythroid-specific transcription start site underlying the association in ATP2B4. Previously reported HLA associations cannot be replicated in this dataset. We estimate substantial heritability of severe malaria (h 2 ~ 23%), of which around 10% is explained by the currently identified associations. Our dataset will provide a major building block for future research on the genetic determinants of disease in these diverse human populations.a single set of estimated haplotypes for 17,960 individuals at the set of over 1.5M SNPs genome-wide that passed our quality control process (Methods). This dataset includes 6,888 individuals from Mali, Burkina Faso, Ghana, Nigeria, Cameroon, Tanzania, Vietnam and Papua New Guinea that have not previously been included in meta-analysis, as well as previously reported data from The Gambia, Malawi and Kenya 9,10 , and thus reflects the haplotype diversity of a substantial portion of the malaria-endemic world.A reference panel enriched for African DNA improves imputation accuracy across the genome The ethnically diverse nature of our study provides challenges for genomic inference, including for our ability to impute genotypes at potentially relevant untyped loci 11 . To address this, we sequenced the genomes of 773 individuals from ten ethnic groups in east and west Africa (specifically from the Gambia, Burkina Faso, Cameroon and Tanzania), including 207 family trios (Figure 1). We combined genotypes at single nucleotide polymorphisms (SNPs) in these data with Phase 3 of the 1000 Genomes Project to form an imputation reference panel which covers most common genetic variation 12 and in which two-fifths of the donor families are of African ancestry (1,203 of 3,046 individuals). In principle, the additional representation of African DNA in this panel should lead to improvements in imputation accuracy for African study populations, and we found that this was indeed the case, with use of our panel leading to a large increase in accuracy relative to panels used in our previous GWAS 9-11 and a more modest improvement relative to using the 1000 Genomes Phase 3 panel alone (Figure S1). Imputation of Vietnamese individuals, and those from Papua New Guinea (PNG) which is substantially diverged from any reference panel population (Figure 1d), were less affected by the inclusion of these additional haplotypes.We specifically examined imputation of malaria-protective alleles in the HBB gene, which have previously been found difficult to impute 9,11 . The SNP encoding the sickle cell mutation (rs334, chr11:5248232) was imputed with r>0.9 in all African populations, as compared with genotype...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.