2017
DOI: 10.1186/s13059-017-1364-2
|View full text |Cite
|
Sign up to set email alerts
|

PureCLIP: capturing target-specific protein–RNA interaction footprints from single-nucleotide CLIP-seq data

Abstract: The iCLIP and eCLIP techniques facilitate the detection of protein–RNA interaction sites at high resolution, based on diagnostic events at crosslink sites. However, previous methods do not explicitly model the specifics of iCLIP and eCLIP truncation patterns and possible biases. We developed PureCLIP (https://github.com/skrakau/PureCLIP), a hidden Markov model based approach, which simultaneously performs peak-calling and individual crosslink site detection. It explicitly incorporates a non-specific background… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
111
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 131 publications
(121 citation statements)
references
References 34 publications
0
111
1
Order By: Relevance
“…While the top 6mers in the high ranking sites are in line with the corresponding RBP motif(s), this is not necessarily the case for low ranking sites specially for RBPs with low scores. As an example for CPSF6 with AP scores of 0.26 the high ranking binding sites contains mostly AAUAAA and UGUA elements while the low ranking sites are enriched in U-rich elements that have been previously reported as CLIP artifacts [Krakau et al, 2017]. This suggests that the RBP data used in our study may vary considerably in their quality, and indicates a potentially high rate of false positives in some of the (PAR-)CLIP datasets.…”
Section: Performancementioning
confidence: 63%
“…While the top 6mers in the high ranking sites are in line with the corresponding RBP motif(s), this is not necessarily the case for low ranking sites specially for RBPs with low scores. As an example for CPSF6 with AP scores of 0.26 the high ranking binding sites contains mostly AAUAAA and UGUA elements while the low ranking sites are enriched in U-rich elements that have been previously reported as CLIP artifacts [Krakau et al, 2017]. This suggests that the RBP data used in our study may vary considerably in their quality, and indicates a potentially high rate of false positives in some of the (PAR-)CLIP datasets.…”
Section: Performancementioning
confidence: 63%
“…As an example, the high-ranking binding sites for CPSF6 (AP of 0.26) contain mostly AAUAAA and UGUA elements, that is, the polyadenylation signal and up-stream motif recognized by the CFIm complex that CPSF6 is part of (Martin et al 2012). Low ranking CPSF6 sites are enriched in U-rich elements that have been previously reported as CLIP artifacts (Krakau et al 2017). This indicates that the RBP data used in our study vary in terms of the fraction of sequence-specific sites in them, indicating a potentially high rate of false positives in some of the (PAR-)CLIP data sets or, alternatively, specification of sites by features not accounted for in our DNNs.…”
Section: Performance Of Deepripementioning
confidence: 85%
“…Besides the expression profile, matched IgG CLIP results have been used often to estimate background control, but they were unfavorably sparse and artificially over-amplified ( 31 ). Although sequencing of a size-matched input control prior to immunoprecipitation is informative to improve the peak calling process, shown in the eCLIP ( 31 ) and an HMM-based peak calling method with individual crosslink site detection (PureCLIP) ( 44 ), performing such matched experiments with CLIPper is neither always feasible nor so much beneficial in terms of accuracy and sensitivity as show in the comparison (Figure 5 and Supplementary Figure S8 ). This is likely due to relatively superior performance of CLIPick in refining peak signals from lowly expressed transcripts (Figure 5D and Supplementary Figure S9 ).…”
Section: Discussionmentioning
confidence: 99%