“…12,[31][32][33] The data set includes 39 populations from the major language groups: Khoisan (!Kung1, !Kung2, Khwe, Hadza), Nilo-Saharan (Kanuri, Songhai, Turkana, Nubian, Sudanese, Mbuti, Datoga), Afroasiatic (Moroccan Berber, non-Berber Moroccan, Egyptian, Algerian Mozabite, Tuareg, Somalian, Amhara, Hausa, Podokwo, Mandara, Uldeme, Iraqw), Niger-Congo non-Bantu (Fulbe ¼ Fulfulde, Yoruba, Serer, Wolof, Mandinka, Tupuri), and Niger-Congo Bantu (Bubi, Fang, Biaka, Kikuyu, Mozambique1, Mozambique2, Bakaka, Bassa, Mbenzele, Sukuma) ( Figure 1). Some populations represented in the original data sets 12,31 -33 were omitted because they are not found on the African mainland, are Cameroonian populations not represented in the Y chromosome data set, 33 or because linguistic designations could not be inferred.…”