Proteomic variation between individuals has immense potential for identifying novel drug targets and disease mechanisms. However, with high-throughput proteomic technologies still in their infancy, they have largely been applied in large majority European ancestry cohorts (e.g. the UK Biobank). An open question is the degree to which proteomic signatures seen in European and other groups mirror those seen in diverse populations, such as cohorts from Africa. Coupled with genetic information, we can also gain a better understanding of the role of genetic variants in the regulation of the proteome and subsequent disease mechanisms. To address the gap in our understanding of proteomic variation in individuals of African ancestry, we collected proteomic data from 176 individuals across two ethnic groups (Igbo and Yoruba) in Nigeria. These individuals were also stratified into high BMI (BMI > 30 kg/m2) and normal BMI (20 kg/m2< BMI < 30 kg/m2) categories. We characterized differences in plasma protein abundance using the Olink Explore 1536 panel between high and normal BMI individuals, finding strong associations consistent with previously known signals in individuals of European descent. We additionally found 73 sentinel cis-pQTL in this dataset, with 21 lead cis-pQTL not observed in catalogs of variation from European-ancestry individuals. In summary, our study highlights the value of leveraging proteomic data in cohorts of diverse ancestry for investigating trait-specific mechanisms and discovering novel genetic regulators of the plasma proteome.
African populations have been drastically underrepresented in genomics research and failure to capture the genetic diversity across the numerous ethnolinguistic groups (ELGs) found in the continent has hindered the equity of precision medicine initiatives globally. Here, we describe the whole-genome sequencing of 449 Nigerian individuals across 47 unique self-reported ELGs. Population structure analysis reveals genetic differentiation amongst our ELGs, consistent with previous findings. From the 36 million SNPs and indels discovered in our dataset, we provide a high-level catalog of both novel and medically-relevant variation present across the ELGs. These results emphasize the value of this resource for genomics research, with added granularity by representing multiple ELGs from Nigeria. Our results also underscore the potential of using these cohorts with larger sample sizes to improve our understanding of human ancestry and health in Africa.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.