Massively parallel sequencing instruments enable rapid and inexpensive DNA sequence data production. Because these instruments are new, their data require characterization with respect to accuracy and utility. To address this, we sequenced a Caernohabditis elegans N2 Bristol strain isolate using the Solexa Sequence Analyzer, and compared the reads to the reference genome to characterize the data and to evaluate coverage and representation. Massively parallel sequencing facilitates strain-to-reference comparison for genome-wide sequence variant discovery. Owing to the short-read-length sequences produced, we developed a revised approach to determine the regions of the genome to which short reads could be uniquely mapped. We then aligned Solexa reads from C. elegans strain CB4858 to the reference, and screened for single-nucleotide polymorphisms (SNPs) and small indels. This study demonstrates the utility of massively parallel short read sequencing for whole genome resequencing and for accurate discovery of genome-wide polymorphisms.
SUMMARY
Epithelial stem cells self-renew while maintaining multipotency, but the dependence of stem cell properties on maintenance of the epithelial phenotype is unclear. We previously showed that trophoblast stem (TS) cells lacking the protein kinase MAP3K4 maintain properties of both stemness and epithelial-mesenchymal transition (EMT). Here, we show that MAP3K4 controls the activity of the histone acetyltransferase CBP, and that acetylation of histones H2A and H2B by CBP is required to maintain the epithelial phenotype. Combined loss of MAP3K4/CBP activity represses expression of epithelial genes and causes TS cells to undergo EMT while maintaining their self-renewal and multipotency properties. The expression profile of MAP3K4 deficient TS cells defines an H2B acetylation regulated gene signature that closely overlaps with that of human breast cancer cells. Taken together, our data define an epigenetic switch that maintains the epithelial phenotype in TS cells and reveal previously unrecognized genes potentially contributing to breast cancer.
We introduce a web-based tool, Peak Annotation and Visualization (PAVIS), for annotating and visualizing ChIP-seq peak data. PAVIS is designed with non-bioinformaticians in mind and presents a straightforward user interface to facilitate biological interpretation of ChIP-seq peak or other genomic enrichment data. PAVIS, through association with annotation, provides relevant genomic context for each peak, such as peak location relative to genomic features including transcription start site, intron, exon or 5'/3'-untranslated region. PAVIS reports the relative enrichment P-values of peaks in these functionally distinct categories, and provides a summary plot of the relative proportion of peaks in each category. PAVIS, unlike many other resources, provides a peak-oriented annotation and visualization system, allowing dynamic visualization of tens to hundreds of loci from one or more ChIP-seq experiments, simultaneously. PAVIS enables rapid, and easy examination and cross-comparison of the genomic context and potential functions of the underlying genomic elements, thus supporting downstream hypothesis generation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.