2014
DOI: 10.1101/007427
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Butter: High-precision genomic alignment of small RNA-seq data

Abstract: Eukaryotes produce large numbers of small non-coding RNAs that act as specificity determinants for various gene-regulatory complexes. These include microRNAs (miRNAs), endogenous short interfering RNAs (siRNAs), and Piwi-associated RNAs (piRNAs). These RNAs can be discovered, annotated, and quantified using small RNA-seq, a variant RNA-seq method based on highly parallel sequencing. Alignment to a reference genome is a critical step in analysis of small RNA-seq data. Because of their small size (20-30 nts depe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0
1

Year Published

2015
2015
2019
2019

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(17 citation statements)
references
References 20 publications
0
16
0
1
Order By: Relevance
“…Bowtie1 with default settings demonstrated poor accuracy in most tests and provided the third lowest accuracy overall. A recent report has highlighted considerable strand-bias in Bowtie1 when using the default settings and cautioned that this is likely to have negatively impacted previously published analyses (Axtell 2014). In contrast, Bowtie1 using the "best strata" setting was highly accurate in our tests (including multimapped reads), and is used in microRNA prediction pipelines such as miRDeep2 and microRNA target prediction (StarBase) (Friedländer et al 2008;Hackenberg et al 2011).…”
Section: Discussionmentioning
confidence: 72%
“…Bowtie1 with default settings demonstrated poor accuracy in most tests and provided the third lowest accuracy overall. A recent report has highlighted considerable strand-bias in Bowtie1 when using the default settings and cautioned that this is likely to have negatively impacted previously published analyses (Axtell 2014). In contrast, Bowtie1 using the "best strata" setting was highly accurate in our tests (including multimapped reads), and is used in microRNA prediction pipelines such as miRDeep2 and microRNA target prediction (StarBase) (Friedländer et al 2008;Hackenberg et al 2011).…”
Section: Discussionmentioning
confidence: 72%
“…In these latter interactions, activity of the silencing-suppressor proteins P19 and P38, which, respectively, sequesters 21 nt duplex siRNAs (Vargason et al, 2003), and blocks AtDCL4 activity (Azevedo et al, 2010), could determine the observed high accumulation of 22 nt vsRNAs. In this context, a specific activity and/or differential behavior of PolRSV NSs in the two hosts could have a role in determining the different vsRNA profile.…”
Section: The Vsrna Size Profile Is Different In the Two Hostsmentioning
confidence: 99%
“…Clean reads were used for mapping small RNA sequences against the viral (GenBank accession numbers: KJ575619 for the L segment, HQ830188 for the M segment, HQ830187 for the S segment, Margaria et al, 2014), and host genomes (N. benthamiana genome version 0.5, and tomato genome version 2.50) (Naim et al, 2012;Tomato Genome Consortium, 2012), using butter version 0.3.3 (Axtell, 2014), allowing zero mismatches. Read size, distribution along the genomic segments and along single ORFs, polarity, 5 -nt enrichment and hotspot analyses were performed using samtools version 1.0 (Li et al, 2009) and in-house perl scripts, and imaged using Microsoft Excel ® v. 10, exactly as previously described (Margaria et al, 2015a).…”
Section: Bioinformatic Analysis Of Small Rna Sequencesmentioning
confidence: 99%
“…The following Genbank accessions were used for isolate p202/3WT (KJ575619 for the L segment, HQ830188 for the M segment, HQ830187 for the S segment), and p202/3RB (KJ575619 for the L segment, HQ830185 for the M segment, HQ830186 for the S segment). Alignment against plant host genome was also performed using butter (version 0.2.5) (Axtell, 2014), using N. benthamiana genome version 0.5 (Naim et al, 2012) allowing zero mismatches. Reads size and distribution along the viral genome were determined using samtools (version 1.0) (Li et al, 2009) and custom perl scripts.…”
Section: Bioinformatic Analysismentioning
confidence: 99%
“…Adapter sequences (5 adapter sequence: TTCAGAGTTCTACAGTCCGACGATC, 3 adapter sequence: TCGTATGCCGTCTTCTGCTTG) were removed using a custom BGI script, allowing four mismatches. Trimmed reads were aligned against the viral genomic sequences deposited in GenBank (see below) using butter (version 0.2.5) (Axtell, 2014), allowing zero mismatches. The sequences of the three genomic segments of isolate p105 (Genbank accessions KJ575620 for the L segment, KJ575620 for the M segment, DQ376178 for the S segment), were used for the analysis of the WT and the derived mutant strain p105RBMar.…”
Section: Bioinformatic Analysismentioning
confidence: 99%