2015
DOI: 10.1016/j.cell.2015.09.054
|View full text |Cite
|
Sign up to set email alerts
|

Learning the Sequence Determinants of Alternative Splicing from Millions of Random Sequences

Abstract: Most human transcripts are alternatively spliced, and many disease-causing mutations affect RNA splicing. Toward better modeling the sequence determinants of alternative splicing, we measured the splicing patterns of over two million (M) synthetic mini-genes, which include degenerate subsequences totaling over 100 M bases of variation. The massive size of these training data allowed us to improve upon current models of splicing, as well as to gain new mechanistic insights. Our results show that the vast majori… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

16
313
4
1

Year Published

2016
2016
2021
2021

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 252 publications
(334 citation statements)
references
References 46 publications
16
313
4
1
Order By: Relevance
“…′ splice sites, which has been previously shown to affect splicing efficiency (Rosenberg et al 2015). These distances were greater in lincRNAs than in mRNAs, as gauged both by in silico mapping the canonical branch motif ( Fig.…”
Section: Wwwgenomeorgmentioning
confidence: 81%
“…′ splice sites, which has been previously shown to affect splicing efficiency (Rosenberg et al 2015). These distances were greater in lincRNAs than in mRNAs, as gauged both by in silico mapping the canonical branch motif ( Fig.…”
Section: Wwwgenomeorgmentioning
confidence: 81%
“…MAVEs for splicing have already been implemented for both direct genome editing 15 and minigene assays. 14,47,48 MAVEs have also been developed to probe the effects of variants in untranslated regions of mRNAs on message stability and protein expression. [16][17][18] Variation in transcriptional regulatory elements is also important, and pathogenic regulatory variants have been identified for a number of Mendelian disorders.…”
Section: Annotating Every Possible Variant In Disease-related Functiomentioning
confidence: 99%
“…For example, massively parallel reporter assays (MPRAs) query the effects of regulatory DNA variants on the expression of reporter genes, 13 whereas splicing assays reveal variant effects on mRNA processing. 14,15 The effect of variants on mRNA stability and translation can also be measured by multiplex assays. [16][17][18] Finally, deep mutational scans query the effect of amino acid substitutions on protein function.…”
Section: Introductionmentioning
confidence: 99%
“…In humans, genetic and biochemical studies show that exons are first recognized in a process called exon definition, and then introns between them are removed [7][8][9][10][11] . The major exon recognition elements, including the splice donor, acceptor, branchpoint and polypyrimidine tract, taken together are too degenerate alone to discriminate true exons from those not utilized in vivo [12][13][14] .…”
Section: Main Textmentioning
confidence: 99%
“…These sequences are short motifs that are broadly classified as exonic splicing enhancers (ESEs) and suppressors (ESSs) as well as their intronic counterparts 16 , 17 (ISEs & ISSs). Machine learning methods use these and other genomic features trained against genome-wide RNA sequencing datasets to build predictive models of splicing regulation [7][8][9] . However, the predictive power of these models may come almost entirely from sequence conservation rather than the mechanistic understanding of splicing 18,19 .…”
Section: Main Textmentioning
confidence: 99%