We describe the details of a serial analysis of gene expression (SAGE) library construction and analysis platform that has enabled the generation of >298 high-quality SAGE libraries and >30 million SAGE tags primarily from sub-microgram amounts of total RNA purified from samples acquired by microdissection. Several RNA isolation methods were used to handle the diversity of samples processed, and various measures were applied to minimize ditag PCR carryover contamination. Modifications in the SAGE protocol resulted in improved cloning and DNA sequencing efficiencies. Bioinformatic measures to automatically assess DNA sequencing results were implemented to analyze the integrity of ditag structure, linker or cross-species ditag contamination, and yield of high-quality tags per sequence read. Our analysis of singleton tag errors resulted in a method for correcting such errors to statistically determine tag accuracy. From the libraries generated, we produced an essentially complete mapping of reliable 21-base-pair tags to the mouse reference genome sequence for a meta-library of ∼5 million tags. Our analyses led us to reject the commonly held notion that duplicate ditags are artifacts. Rather than the usual practice of discarding such tags, we conclude that they should be retained to avoid introducing bias into the results and thereby maintain the quantitative nature of the data, which is a major theoretical advantage of SAGE as a tool for global transcriptional profiling.[Supplemental material is available online at www.genome.org.]Serial analysis of gene expression (SAGE) offers a particularly attractive technology for profiling eukaryotic transcriptomes (Velculescu et al. 1995) because of the digital and quantitative nature of the data, its efficient sampling of short sequence tags from known and novel mRNA transcripts, and its theoretically unlimited dynamic range. Numerous improvements to the original technology have been described (Peters et al. 1999;Saha et al. 2002;Gowda et al. 2004;Heidenblut et al. 2004;Wei et al. 2004;Kodzius et al. 2006). These include the production of longer tags, which have improved the specificity of tag-to-gene mapping (Saha et al. 2002;Matsumura et al. 2003), and modifications designed to facilitate library construction from nanogram quantities of total RNA (Peters et al. 1999;Neilson et al. 2000). Recently, the use of SAGE-like procedures to identify regions of the genome interacting with DNA-binding proteins has been described (Impey et al. 2004;Chen and Sadowski 2005;Kim et al. 2005;Loh et al. 2006;Wei et al. 2006). Such approaches represent viable alternatives to ChIP-on-chip (Ren et al. 2000).SAGE is among the few relatively accessible digital gene expression profiling technologies capable of generating comprehensive transcriptome profiles. Nevertheless, challenges associated with laborious library construction and generally limited access to inexpensive automated DNA sequencing have restricted its application to large-scale initiatives. Experiments at our Genome Center ) and e...
To facilitate discovery of novel human embryonic stem cell (ESC) transcripts, we generated 2.5 million LongSAGE tags from 9 human ESC lines. Analysis of this data revealed that ESCs express proportionately more RNA binding proteins compared with terminally differentiated cells, and identified novel ESC transcripts, at least one of which may represent a marker of the pluripotent state.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.