Although Slavic populations account for over 4.5% of world inhabitants, no centralised, open-source reference database of genetic variation of any Slavic population exists to date. Such data are crucial for clinical genetics, biomedical research, as well as archeological and historical studies. The Polish population, which is homogenous and sedentary in its nature but influenced by many migrations of the past, is unique and could serve as a genetic reference for the Slavic nations. In this study, we analysed whole genomes of 1222 Poles to identify and genotype a wide spectrum of genomic variation, such as small and structural variants, runs of homozygosity, mitochondrial haplogroups, and de novo variants. Common variant analyses showed that the Polish cohort is highly homogenous and shares ancestry with other European populations. In rare variant analyses, we identified 32 autosomal-recessive genes with significantly different frequencies of pathogenic alleles in the Polish population as compared to the non-Finish Europeans, including C2, TGM5, NUP93, C19orf12, and PROP1. The allele frequencies for small and structural variants, calculated for 1076 unrelated individuals, are released publicly as The Thousand Polish Genomes database, and will contribute to the worldwide genomic resources available to researchers and clinicians.
Background Genetic causes that lead to spermatogenetic failure in patients with nonobstructive azoospermia (NOA) have not been yet completely established. Objective To identify low‐frequency NOA‐associated single nucleotide variants (SNVs) using whole‐genome sequencing (WGS). Materials and methods Men with various types of NOA (n = 39), including samples that had been previously tested with whole‐exome sequencing (WES; n = 6) and did not result in diagnostic conclusions. Variants were annotated using the Ensembl Variant Effect Predictor, utilizing frequencies from GnomAD and other databases to provide clinically relevant information (ClinVar), conservation scores (phyloP), and effect predictions (i.e., MutationTaster). Structural protein modeling was also performed. Results Using WGS, we revealed potential NOA‐associated SNVs, such as: TKTL1, IGSF1, ZFPM2, VCX3A (novel disease causing variants), ESX1, TEX13A, TEX14, DNAH1, FANCM, QRICH2, FSIP2, USP9Y, PMFBP1, MEI1, PIWIL1, WDR66, ZFX, KCND1, KIAA1210, DHRSX, ZMYM3, FAM47C, FANCB, FAM50B (genes previously known to be associated with infertility) and ALG13, BEND2, BRWD3, DDX53, TAF4, FAM47B, FAM9B, FAM9C, MAGEB6, MAP3K15, RBMXL3, SSX3 and FMR1NB genes, which may be involved in spermatogenesis. Discussion and conclusion In this study, we identified novel potential candidate NOA‐associated genes in 29 individuals out of 39 azoospermic males. Note that in 5 out of 6 patients subjected previously to WES analysis, which did not disclose potentially causative variants, the WGS analysis was successful with NOA‐associated gene findings.
Although Slavic populations account for over 3.5% of world inhabitants, no centralized, open source reference database of genetic variation of any Slavic population exists to date. Such data are crucial for either biomedical research and genetic counseling and are essential for archeological and historical studies. Polish population, homogenous and sedentary in its nature but influenced by many migrations of the past, is unique and could serve as a good genetic reference for middle European Slavic nations.The aim of the present study was to describe first results of analyses of a newly created national database of Polish genomic variant allele frequencies. Never before has any study on the whole genomes of Polish population been conducted on such a large number of individuals (1,079).A wide spectrum of genomic variation was identified and genotyped, such as small and structural variants, runs of homozygosity, mitochondrial haplogroups and Mendelian inconsistencies. The allele frequencies were calculated for 943 unrelated individuals and released publicly as The Thousand Polish Genomes database. A precise detection and characterisation of rare variants enriched in the Polish population allowed to confirm the allele frequencies for known pathogenic variants in diseases, such as Smith-Lemli-Opitz syndrome (SLOS) or Nijmegen breakage syndrome (NBS). Additionally, the analysis of OMIM AR genes led to the identification of 22 genes with significantly different cumulative allele frequencies in the Polish (POL) vs European NFE population. We hope that The Thousand Polish Genomes database will contribute to the worldwide genomic data resources for researchers and clinicians.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.