2022
DOI: 10.1101/2022.10.11.22280955
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Repeat polymorphisms in non-coding DNA underlie top genetic risk loci for glaucoma and colorectal cancer

Abstract: Many regions in the human genome vary in length among individuals due to variable numbers of tandem repeats (VNTRs). We recently showed that protein-coding VNTRs underlie some of the strongest known genetic associations with diverse phenotypes. Here, we assessed the phenotypic impact of VNTRs genome-wide, 99% of which lie in non-coding regions. We applied a statistical imputation approach to estimate the lengths of 9,561 autosomal VNTR loci in 418,136 unrelated UK Biobank participants. Association and statisti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 8 publications
(9 citation statements)
references
References 71 publications
0
9
0
Order By: Relevance
“…This strategy proved fruitful: association analyses with 57 quantitative traits identified 375 associations at 99 loci not explainable by linkage disequilibrium (LD) to nearby SNPs (Supplementary Data 4), recovering strong VNTR-phenotype associations we recently reported (including a 39-bp coding repeat in GP1BA associated with platelet traits 35 ; P = 1.1 × 10 −133 ), and revealing several new loci involving multicopy variation poorly tagged by SNPs. Follow-up analyses of the most intriguing associations, detailed below, enabled further exploration of genetic variation at these loci and its influences on human health.…”
Section: Articlementioning
confidence: 54%
“…This strategy proved fruitful: association analyses with 57 quantitative traits identified 375 associations at 99 loci not explainable by linkage disequilibrium (LD) to nearby SNPs (Supplementary Data 4), recovering strong VNTR-phenotype associations we recently reported (including a 39-bp coding repeat in GP1BA associated with platelet traits 35 ; P = 1.1 × 10 −133 ), and revealing several new loci involving multicopy variation poorly tagged by SNPs. Follow-up analyses of the most intriguing associations, detailed below, enabled further exploration of genetic variation at these loci and its influences on human health.…”
Section: Articlementioning
confidence: 54%
“…58 Our findings pave the way for a novel approach to investigating the influence of genetic factors on brain biology. We, and others, [15][16][17]59 have demonstrated the power of TR-wide association studies to identify significant genetic effects independent of those discovered by SNP-focused GWAS. In particular, this work expands our understanding of the genetic architecture of subcortical volumes, underscoring the importance of considering repetitive elements in the broader landscape of brain genetics.…”
Section: Discussionmentioning
confidence: 88%
“…Exclusion of causal variants could in principle induce false-positive results at tagging variants that survive this filter; however, the functional enrichment of fine-mapped (PIP > 0.5) variants that are rare or low-frequency (MAF < 0.05) in at least one population is similar to that of variants that are common (MAF > 0.05) in both populations (2.85 vs 2.88, p = 0.91 for difference), suggesting that our exclusion criteria did not produce substantial false positives in practice. Second, as with all fine-mapping methods, exclusion of variants that are difficult to genotype, even in WGS data, (including tandem repeats [77][78][79] and other structural variants 80,81 ) could induce false-positive results at tagging variants. Indeed, at the HBA locus for mean corpuscular volume, SuSiE using Afr47k fine-mapped (at PIP > 0.5) 17 single nucleotide variants, 1 insertion, and 1 deletion (Supplementary Table 15); all fine-mapped variants were jointly significant in a multiple linear regression model.…”
Section: Discussionmentioning
confidence: 99%