Small RNA-seq is increasingly being used for profiling of small RNAs. Quantitative characteristics of long RNA-seq have been extensively described, but small RNA-seq involves fundamentally different methods for library preparation, with distinct protocols and technical variations that have not been fully and systematically studied. We report here the results of a study using common references (synthetic RNA pools of defined composition, as well as plasma-derived RNA) to evaluate the accuracy, reproducibility and bias of small RNA-seq library preparation for five distinct protocols and across nine different laboratories. We observed protocol-specific and sequence-specific bias, which was ameliorated using adapters for ligation with randomized end-nucleotides, and computational correction factors. Despite this technical bias, relative quantification using small RNA-seq was remarkably accurate and reproducible, even across multiple laboratories using different methods. These results provide strong evidence for the feasibility of reproducible cross-laboratory small RNA-seq studies, even those involving analysis of data generated using different protocols.
(Introduction without separate heading below)RNA-seq using next generation sequencing has been a transformative technology that has been widely used as a method for characterizing the transcriptome in a wide range of biological contexts 1,2 . Applications of RNA-seq fall into two categories: long RNA-seq and small RNA-seq, distinguished not only by the size of the targeted RNAs, but also by the technical methods used and the resulting biases of the different approaches in the quantitative data produced 3 . For example, the production of the libraries for long RNA-seq, by virtue of having sufficiently long target RNA lengths, commonly utilizes primers (e.g., random primers or oligo-dT) for direct generation of cDNA from RNA. However, small RNA-seq library construction methods typically require an RNA ligation or polyA tailing step to overcome the challenge of performing reverse transcription and subsequent PCR-based amplification from extremely short (e.g., 16-30 nt) target RNA sequences.Multiple approaches have been developed to overcome the challenge of uniformly and robustly generating cDNA from small RNAs for the purpose of small RNA-seq library preparation 4-9 . Protocols in use for small RNA-seq therefore vary more widely than those used for long RNA-seq, creating significantly more potential for variation across small RNA-seq results using different library preparation protocols and by different labs. In addition, small RNA-seq is increasingly used to study small RNAs present in very low input concentration samples (e.g., in exosomes and other types of extracellular vesicles (EV) 10-19 , or in RNA-protein complexes present in biofluids 20-26 ). Normalization methods 27-29 developed to correct for variation in long RNA-seq data are typically not well-suited for such small RNA-seq data, making it even more important to
This protocol describes a library preparation method for sequencing small RNA. The method uses degenerate adapters to alleviate the problem of bias in the ligation steps during small RNA library preparation, and it optimizes several other parameters to make it appropriate for use specifically with plasma RNA.
This protocol describes a library preparation method for sequencing small RNA. The method uses degenerate adapters to alleviate the problem of bias in the ligation steps during small RNA library preparation, and it optimizes several other parameters to make it appropriate for use specifically with plasma RNA.
This protocol describes a library preparation method for sequencing small RNA. The method uses degenerate adapters to alleviate the problem of bias in the ligation steps during small RNA library preparation, and it optimizes several other parameters to make it appropriate for use specifically with plasma RNA.
This protocol describes a library preparation method for sequencing small RNA. The method uses degenerate adapters to alleviate the problem of bias in the ligation steps during small RNA library preparation, and it optimizes several other parameters to make it appropriate for use specifically with plasma RNA.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.