Plant endogenous small RNAs (sRNAs) are important regulators of gene expression. There are two broad categories of plant sRNAs: microRNAs (miRNAs) and endogenous short interfering RNAs (siRNAs). MicroRNA loci are relatively well-annotated but compose only a small minority of the total sRNA pool; siRNA locus annotations have lagged far behind. Here, we used a large data set of published and newly generated sRNA sequencing data (1333 sRNA-seq libraries containing more than 20 billion reads) and a uniform bioinformatic pipeline to produce comprehensive sRNA locus annotations of 47 diverse plants, yielding more than 2.7 million sRNA loci. The two most numerous classes of siRNA loci produced mainly 24-and 21-nucleotide (nt) siRNAs, respectively. Most often, 24-nt-dominated siRNA loci occurred in intergenic regions, especially at the 5 ′-flanking regions of protein-coding genes. In contrast, 21-nt-dominated siRNA loci were most often derived from double-stranded RNA precursors copied from spliced mRNAs. Genic 21-nt-dominated loci were especially common from disease resistance genes, including from a large number of monocots. Individual siRNA sequences of all types showed very little conservation across species, whereas mature miRNAs were more likely to be conserved. We developed a web server where our data and several search and analysis tools are freely accessible.
22Plant endogenous small RNAs (sRNAs) are important regulators of gene expression. 23There are two broad categories of plant sRNAs: microRNAs (miRNAs) and endogenous short 24interfering RNAs (siRNAs). MicroRNA loci are relatively well-annotated but comprise only a 25 small minority of the total sRNA pool; siRNA locus annotations have lagged far behind. Here, we 26 used a large dataset of published and newly generated sRNA sequencing data (1,333 sRNA-seq 27 libraries containing over 20 billion reads) and a uniform bioinformatic pipeline to produce 28 comprehensive sRNA locus annotations of 47 diverse plants, yielding over 2.7 million sRNA loci. 29The two most numerous classes of siRNA loci produced mainly 24 nucleotide and 21 nucleotide 30 siRNAs, respectively. 24 nucleotide-dominated siRNA loci usually occurred in intergenic regions, 31 especially at the 5'-flanking regions of protein-coding genes. In contrast, 21 nucleotide-32 dominated siRNA loci were most often derived from double-stranded RNA precursors copied 33 from spliced mRNAs. Genic 21 nucleotide-dominated loci were especially common from disease 34 resistance genes, including from a large number of monocots. Individual siRNA sequences of all 35 types showed very little conservation across species, while mature miRNAs were more likely to 36 be conserved. We developed a web server where our data and several search and analysis tools 37 are freely accessible at
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.