Arab populations are largely understudied, notably their genetic structure and history. Here we present an in-depth analysis of 6,218 whole genomes from Qatar, revealing extensive diversity as well as genetic ancestries representing the main founding Arab genealogical lineages of Qahtanite (Peninsular Arabs) and Adnanite (General Arabs and West Eurasian Arabs). We find that Peninsular Arabs are the closest relatives of ancient hunter-gatherers and Neolithic farmers from the Levant, and that founder Arab populations experienced multiple splitting events 12–20 kya, consistent with the aridification of Arabia and farming in the Levant, giving rise to settler and nomadic communities. In terms of recent genetic flow, we show that these ancestries contributed significantly to European, South Asian as well as South American populations, likely as a result of Islamic expansion over the past 1400 years. Notably, we characterize a large cohort of men with the ChrY J1a2b haplogroup (n = 1,491), identifying 29 unique sub-haplogroups. Finally, we leverage genotype novelty to build a reference panel of 12,432 haplotypes, demonstrating improved genotype imputation for both rare and common alleles in Arabs and the wider Middle East.
In a clinical setting, DNA sequencing can uncover findings unrelated to the purpose of genetic evaluation. The American College of Medical Genetics and Genomics (ACMG) recommends the evaluation and reporting of 59 genes from clinic genomic sequencing. While the prevalence of secondary findings is available from large population studies, these data lack Arab and other Middle Eastern populations. The Qatar Genome Program (QGP) generates whole-genome sequencing (WGS) data and combines it with phenotypic information to create a comprehensive database for studying the Qatari and wider Arab and Middle Eastern populations at the molecular level. This study identified and analyzed medically actionable variants in the 59 ACMG genes using WGS data from 6045 QGP participants. Our results identified a total of 60 pathogenic and likely pathogenic variants in 25 ACMG genes in 141 unique individuals. Overall, 2.3% of the QGP sequenced participants carried a pathogenic or likely pathogenic variant in one of the 59 ACMG genes. We evaluated the QGP phenotype-genotype association of additional nonpathogenic ACMG variants. These variants were found in patients from the Hamad Medical Corporation or reported incidental findings data in Qatar. We found a significant phenotype association for two variants, c.313+3A>C in LDLR, and c.58C>T (p.Gln20*) in the TPM1.
Lung-function impairment underlies chronic obstructive pulmonary disease (COPD) and predicts mortality. In the largest multi-ancestry genome-wide association meta-analysis of lung function to date, comprising 580,869 participants, we identified 1,020 independent association signals implicating 559 genes supported by ≥2 criteria from a systematic variant-to-gene mapping framework. These genes were enriched in 29 pathways. Individual variants showed heterogeneity across ancestries, age and smoking groups, and collectively as a genetic risk score showed strong association with COPD across ancestry groups. We undertook phenome-wide association studies for selected associated variants as well as trait and pathway-specific genetic risk scores to infer possible consequences of intervening in pathways underlying lung function. We highlight new putative causal variants, genes, proteins and pathways, including those targeted by existing drugs. These findings bring us closer to understanding the mechanisms underlying lung function and COPD, and should inform functional genomics experiments and potentially future COPD therapies.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.