2022
DOI: 10.1371/journal.pgen.1010191
|View full text |Cite
|
Sign up to set email alerts
|

Classification of non-coding variants with high pathogenic impact

Abstract: Whole genome sequencing is increasingly used to diagnose medical conditions of genetic origin. While both coding and non-coding DNA variants contribute to a wide range of diseases, most patients who receive a WGS-based diagnosis today harbour a protein-coding mutation. Functional interpretation and prioritization of non-coding variants represents a persistent challenge, and disease-causing non-coding variants remain largely unidentified. Depending on the disease, WGS fails to identify a candidate variant in 20… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0
2

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 23 publications
(17 citation statements)
references
References 50 publications
0
15
0
2
Order By: Relevance
“…The method most comparable to the strategy followed by PSAP-genomic-regions is the recently-developed machine-learning algorithm FINSURF (42). FINSURF aims to predict the functional impact of non-coding variants in regulatory regions and has been applied to known pathogenic variants inserted in WGS data like we did.…”
Section: Discussionmentioning
confidence: 99%
“…The method most comparable to the strategy followed by PSAP-genomic-regions is the recently-developed machine-learning algorithm FINSURF (42). FINSURF aims to predict the functional impact of non-coding variants in regulatory regions and has been applied to known pathogenic variants inserted in WGS data like we did.…”
Section: Discussionmentioning
confidence: 99%
“…Over the last few years, it has become increasingly clear that the majority of disease‐associated mutations reside within noncoding regulatory enhancer regions (Claringbould et al, 2021; Kvon et al, 2020; Moyon et al, 2022). For instance, based on GWAS data, a SNP rs17640804 (C > T) in the GLI3 noncoding region (7q13, intron 3) has been associated with human face morphology, more specifically, with determining the nose wing breadth (Adhikari et al, 2016).…”
Section: Discussionmentioning
confidence: 99%
“…To assess for linkage disequilibrium between variants, r 2 values were calculated for each variant pair in this region (with a combined minimum case and control allele count of 3). Variants of interest were further annotated with CADD and FINSURF scores 13 , frequency in gnomADv3 East Asian genome sequences (n=2,604) and conservation of affected base across species using Ensembl v109 (Feb 2023). The frequency of the putative causal variant was also assessed in other genomic databases with individuals of Asian ancestry – the Genomics Thailand database of 10,043 individuals of Thai ancestry, the Singaporean SG10K_Health v5.3 database of individuals of Chinese, Indian and Malay ancestry 33 , the jMORP/54KJPN database of individuals of Japanese ancestry 34 and the KOVA database of individuals of Korean ancestry 35 .…”
Section: Methodsmentioning
confidence: 99%
“…The RE5 variant alters a nucleotide fully conserved across 35 mammalian species with available orthologous sequence data, in contrast to the other three variants (Fig. S1) and has higher computational deleteriousness scores from the CADD and FINSURF 13 algorithms (Table 2). The RE5 variant is also absent in the gnomADv3 database (n=76,165 genomes including n=2,604 of East Asian (EAS) ancestry), whereas the SCN10A -short promoter variant has a gnomAD-EAS frequency of 0.001, indicating it also exists on haplotypes distinct from the RE5 variant.…”
Section: Mainmentioning
confidence: 99%