Background
Population demography and gene flow among African groups, as well as the putative archaic introgression of ancient hominins, have been poorly explored at the genome level.
Results
Here, we examine 15 African populations covering all major continental linguistic groups, ecosystems, and lifestyles within Africa through analysis of whole-genome sequence data of 21 individuals sequenced at deep coverage. We observe a remarkable correlation among genetic diversity and geographic distance, with the hunter-gatherer groups being more genetically differentiated and having larger effective population sizes throughout most modern-human history. Admixture signals are found between neighbor populations from both hunter-gatherer and agriculturalists groups, whereas North African individuals are closely related to Eurasian populations. Regarding archaic gene flow, we test six complex demographic models that consider recent admixture as well as archaic introgression. We identify the fingerprint of an archaic introgression event in the sub-Saharan populations included in the models (~ 4.0% in Khoisan, ~ 4.3% in Mbuti Pygmies, and ~ 5.8% in Mandenka) from an early divergent and currently extinct ghost modern human lineage.
Conclusion
The present study represents an in-depth genomic analysis of a Pan African set of individuals, which emphasizes their complex relationships and demographic history at population level.
Electronic supplementary material
The online version of this article (10.1186/s13059-019-1684-5) contains supplementary material, which is available to authorized users.
Highlights d Paleolithic genetic continuity is found in extant human North African genomes d There is a West-to-East genetic cline of the Paleolithic component in North Africa d Neolithization had a larger demographic impact than Arabization d Differential admixture and genetic drift have modeled North African genomes
The analysis of the complete mtDNA genome has allowed for the identification of a North African sub-lineage that might be ignored by the analysis of partial mtDNA control region sequences, highlighting the phylogeographic relevance of mtDNA complete sequence analysis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.