2016
DOI: 10.1016/j.ajhg.2016.07.005
|View full text |Cite
|
Sign up to set email alerts
|

A Whole-Genome Analysis Framework for Effective Identification of Pathogenic Regulatory Variants in Mendelian Disease

Abstract: The interpretation of non-coding variants still constitutes a major challenge in the application of whole-genome sequencing in Mendelian disease, especially for single-nucleotide and other small non-coding variants. Here we present Genomiser, an analysis framework that is able not only to score the relevance of variation in the non-coding genome, but also to associate regulatory variants to specific Mendelian diseases. Genomiser scores variants through either existing methods such as CADD or a bespoke machine … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

5
269
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
7
1
1

Relationship

4
5

Authors

Journals

citations
Cited by 245 publications
(274 citation statements)
references
References 74 publications
5
269
0
Order By: Relevance
“…Scores for all possible SNVs genome-wide were downloaded from CADD 24 (http://cadd.gs.washington.edu/download), Genomiser 41 (https://charite.github.io/software-remm-score.html#download), and fathmm-MKL 42 (https://github.com/HAShihab/fathmm-MKL). …”
Section: Methodsmentioning
confidence: 99%
“…Scores for all possible SNVs genome-wide were downloaded from CADD 24 (http://cadd.gs.washington.edu/download), Genomiser 41 (https://charite.github.io/software-remm-score.html#download), and fathmm-MKL 42 (https://github.com/HAShihab/fathmm-MKL). …”
Section: Methodsmentioning
confidence: 99%
“…We subdivided the Mendelian data that include 406 manually annotated "positive" deleterious SNV and more than 14 millions of neutral "negative" SNVs in a training set including about 9/10 of the available data and a separated test set including the remaining 1/10 of data, using the same set of genomic features described in [Smedley et al, 2016]. We then compared the hyperSMURF results obtained by using the default parameters (i.e.…”
Section: Resultsmentioning
confidence: 99%
“…Indeed in this context classical machine learning methods are biased toward neutral variants that constitute the large majority of genetic variation, and are not able to detect the potential deleterious variants that constitute only a tiny minority of all known genetic variation [Smedley et al, 2016].…”
Section: Introductionmentioning
confidence: 99%
“…However, it should be noted that non-coding variant prioritization tools are less accurate than their protein-coding counterparts. Although many new approaches are being developed 39,94,95 , there is simply insufficient understanding of the regulatory machinery encrypted in non-coding DNA to prioritize non-coding variants with similar accuracy to that of coding variants 94 .…”
Section: Current Challenges and Emerging Solutionsmentioning
confidence: 99%