2020
DOI: 10.1101/gr.247494.118
|View full text |Cite
|
Sign up to set email alerts
|

Deep neural networks for interpreting RNA-binding protein target preferences

Abstract: Deep learning has become a powerful paradigm to analyze the binding sites of regulatory factors including RNA-binding proteins (RBPs), owing to its strength to learn complex features from possibly multiple sources of raw data. However, the interpretability of these models, which is crucial to improve our understanding of RBP binding preferences and functions, has not yet been investigated in significant detail. We have designed a multitask and multimodal deep neural network for characterizing in vivo RBP targe… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
130
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 96 publications
(138 citation statements)
references
References 43 publications
0
130
0
Order By: Relevance
“…We curated a list of 205 genes of interest in B. malayi and D. immitis, consisting of known anthelmintic targets and receptors belonging to druggable protein families in parasitic of reads contained the canonical poly(A) signal (PAS) AAUAA, while A-rich regions were the second most abundant PAS. (C) All PASs occur 10-20 nucleotides upstream of the poly(A) cleavage site, other than AAAAA, which has two A-rich sites at 10-20 and 45-50 nucleotides upstream, and UAAUAA, which may not be a PAS but is a motif for RNA binding proteins [64,65]. (D) All PASs also include an A-rich region at 15 nucleotides upstream of the cleavage site.…”
Section: Improvement Of Gene Models For Known and Putative Anthelmintmentioning
confidence: 99%
“…We curated a list of 205 genes of interest in B. malayi and D. immitis, consisting of known anthelmintic targets and receptors belonging to druggable protein families in parasitic of reads contained the canonical poly(A) signal (PAS) AAUAA, while A-rich regions were the second most abundant PAS. (C) All PASs occur 10-20 nucleotides upstream of the poly(A) cleavage site, other than AAAAA, which has two A-rich sites at 10-20 and 45-50 nucleotides upstream, and UAAUAA, which may not be a PAS but is a motif for RNA binding proteins [64,65]. (D) All PASs also include an A-rich region at 15 nucleotides upstream of the cleavage site.…”
Section: Improvement Of Gene Models For Known and Putative Anthelmintmentioning
confidence: 99%
“…Finally, another contribution of our paper is to quantify the impact of and rectify a specific type of sequence bias present in certain CLIP-Seq datasets, originally identified by Kishore et al (2011) and later emphasized in Ghanbari and Ohler (2020) , which artificially inflated the reported prediction accuracy of several approaches published recently.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, deep neural networks (DNNs), predominantly based on convolutional neural networks (CNNs) or convolutionalrecurrent network hybrids, have emerged as a promising alternative, in most cases, improving prediction performance on held-out test data [13][14][15][16][17][18][19] . DNNs are a powerful class of models that can learn a functional mapping between input genomic sequences and experimentally measured labels, requiring minimal feature engineering [20][21][22] .…”
Section: Introductionmentioning
confidence: 99%
“…At the time, it demonstrated improved performance over PWM-and k-mer-based methods on the 2013-RNAcompete dataset, a standard benchmark dataset that consists of 244 in vitro affinity selection experiments that span across many RBP families 5 . Since then, other deep learning-based methods have emerged, further improving prediction performance on this dataset [23][24][25] and other CLIP-seq-based datasets 11,18,26,27 .…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation