C57BL/6J (B6) and DBA/2J (D2) are two of the most commonly used inbred mouse strains in neuroscience research. However, the only currently available mouse genome is based entirely on the B6 strain sequence. Subsequently, oligonucleotide microarray probes are based solely on this B6 reference sequence, making their application for gene expression profiling comparisons across mouse strains dubious due to their allelic sequence differences, including single nucleotide polymorphisms (SNPs). The emergence of next-generation sequencing (NGS) and the RNA-Seq application provides a clear alternative to oligonucleotide arrays for detecting differential gene expression without the problems inherent to hybridization-based technologies. Using RNA-Seq, an average of 22 million short sequencing reads were generated per sample for 21 samples (10 B6 and 11 D2), and these reads were aligned to the mouse reference genome, allowing 16,183 Ensembl genes to be queried in striatum for both strains. To determine differential expression, ‘digital mRNA counting’ is applied based on reads that map to exons. The current study compares RNA-Seq (Illumina GA IIx) with two microarray platforms (Illumina MouseRef-8 v2.0 and Affymetrix MOE 430 2.0) to detect differential striatal gene expression between the B6 and D2 inbred mouse strains. We show that by using stringent data processing requirements differential expression as determined by RNA-Seq is concordant with both the Affymetrix and Illumina platforms in more instances than it is concordant with only a single platform, and that instances of discordance with respect to direction of fold change were rare. Finally, we show that additional information is gained from RNA-Seq compared to hybridization-based techniques as RNA-Seq detects more genes than either microarray platform. The majority of genes differentially expressed in RNA-Seq were only detected as present in RNA-Seq, which is important for studies with smaller effect sizes where the sensitivity of hybridization-based techniques could bias interpretation.
Background Heterogeneous stock (HS/NPT) mice have been used to create lines selectively bred in replicate for High Drinking in the Dark (HDID) (Crabbe et al., 2009; Crabbe et al., 2011b). Both selected lines routinely reach a blood ethanol concentration of 1.00 mg/ml or greater at the end of the day 2 four hour period of access. The mechanisms through which genetic differences influence DID are currently unclear. Therefore, the current study examines the transcriptome, the first stage at which genetic variability affects neurobiology. Rather than focusing solely on differential expression, we also examine changes in the ways that gene transcripts collectively interact with each other, as revealed by changes in coexpression patterns. Methods Naïve mice (N=48/group) were genotyped using the Mouse Universal Genotyping Array, which provided 3683 informative markers. QTL analysis used a marker-by-marker strategy with the threshold for a significant LOD set at 10.6. Gene expression in the ventral striatum was measured using the Illumina Mouse 8.2 array. Differential gene expression and the Weighted Gene Coexpression Network Analysis (WGCNA) were implemented largely as described elsewhere (Iancu et al., 2010; Iancu et al., 2012b). Results Significant quantitative trait loci (QTLs) for elevated blood ethanol concentrations after DID were detected on chromosomes 4, 14 and 16; the latter two were associated with gene poor regions. None of the QTLs overlapped with known QTLs for ethanol preference drinking. Ninety- four transcripts were detected as being differentially expressed in both selected lines vs HS controls; there was no overlap with known preference genes. The WGCNA revealed 2 modules as showing significant effects of both selections on intramodular connectivity. A number of genes known to be associated with ethanol phenotypes (e.g. Gabarg1, Glra2, Grik1, Npy2r and Nts) showed significant changes in connectivity. Discussion We found marked and consistent effects of selection on coexpression patterns; differential expression changes were more modest and less concordant. The QTLs and differentially expressed genes detected here are distinct from the preference phenotype (Mulligan et al., 2006). This is consistent with behavioral data and suggests that the DID and preference phenotypes are markedly different genetically.
BackgroundThe current study focused on the extent genetic diversity within a species (Mus musculus) affects gene co-expression network structure. To examine this issue, we have created a new mouse resource, a heterogeneous stock (HS) formed from the same eight inbred strains that have been used to create the collaborative cross (CC). The eight inbred strains capture > 90% of the genetic diversity available within the species. For contrast with the HS-CC, a C57BL/6J (B6) × DBA/2J (D2) F2 intercross and the HS4, derived from crossing the B6, D2, BALB/cJ and LP/J strains, were used. Brain (striatum) gene expression data were obtained using the Illumina Mouse WG 6.1 array, and the data sets were interrogated using a weighted gene co-expression network analysis (WGCNA).ResultsGenes reliably detected as expressed were similar in all three data sets as was the variability of expression. As measured by the WGCNA, the modular structure of the transcriptome networks was also preserved both on the basis of module assignment and from the perspective of the topological overlap maps. Details of the HS-CC gene modules are provided; essentially identical results were obtained for the HS4 and F2 modules. Gene ontology annotation of the modules revealed a significant overrepresentation in some modules for neuronal processes, e.g., central nervous system development. Integration with known protein-protein interactions data indicated significant enrichment among co-expressed genes. We also noted significant overlap with markers of central nervous system cell types (neurons, oligodendrocytes and astrocytes). Using the Allen Brain Atlas, we found evidence of spatial co-localization within the striatum for several modules. Finally, for some modules it was possible to detect an enrichment of transcription binding sites. The binding site for Wt1, which is associated with neurodegeneration, was the most significantly overrepresented.ConclusionsDespite the marked differences in genetic diversity, the transcriptome structure was remarkably similar for the F2, HS4 and HS-CC. These data suggest that it should be possible to integrate network data from simple and complex crosses. A careful examination of the HS-CC transcriptome revealed the expected structure for striatal gene expression. Importantly, we demonstrate the integration of anatomical and network expression data.
Previous studies on changes in murine brain gene expression associated with the selection for ethanol preference have used F2 intercross or heterogeneous stock (HS) founders, derived from standard laboratory strains. However, these populations represent only a small proportion of the genetic variance available in Mus musculus. To investigate a wider range of genetic diversity, we selected mice for ethanol preference using an HS derived from the eight strains of the collaborative cross. These HS mice were selectively bred (four generations) for high and low ethanol preference. The nucleus accumbens shell of naive S4 mice was interrogated using RNA sequencing (RNA-Seq). Gene networks were constructed using the weighted gene coexpression network analysis assessing both coexpression and cosplicing. Selection targeted one of the network coexpression modules (greenyellow) that was significantly enriched in genes associated with receptor signaling activity including Chrna7, Grin2a, Htr2a and Oprd1. Connectivity in the module as measured by changes in the hub nodes was significantly reduced in the low preference line. Of particular interest was the observation that selection had marked effects on a large number of cell adhesion molecules, including cadherins and protocadherins. In addition, the coexpression data showed that selection had marked effects on long non-coding RNA hub nodes. Analysis of the cosplicing network data showed a significant effect of selection on a large cluster of Ras GTPase-binding genes including Cdkl5, Cyfip1, Ndrg1, Sod1 and Stxbp5. These data in part support the earlier observation that preference is linked to Ras/Mapk pathways.
Across species and tissues and especially in the mammalian brain, production of gene isoforms is widespread. While gene expression coordination has been previously described as a scale-free coexpression network, the properties of transcriptome-wide isoform production coordination have been less studied. Here we evaluate the system-level properties of cosplicing in mouse, macaque, and human brain gene expression data using a novel network inference procedure. Genes are represented as vectors/lists of exon counts and distance measures sensitive to exon inclusion rates quantifies differences across samples. For all gene pairs, distance matrices are correlated across samples, resulting in cosplicing or cotranscriptional network matrices. We show that networks including cosplicing information are scale-free and distinct from coexpression. In the networks capturing cosplicing we find a set of novel hubs with unique characteristics distinguishing them from coexpression hubs: heavy representation in neurobiological functional pathways, strong overlap with markers of neurons and neuroglia, long coding lengths, and high number of both exons and annotated transcripts. Further, the cosplicing hubs are enriched in genes associated with autism spectrum disorders. Cosplicing hub homologs across eukaryotes show dramatically increasing intronic lengths but stable coding region lengths. Shared transcription factor binding sites increase coexpression but not cosplicing; the reverse is true for splicing-factor binding sites. Genes with protein-protein interactions have strong coexpression and cosplicing. Additional factors affecting the networks include shared microRNA binding sites, spatial colocalization within the striatum, and sharing a chromosomal folding domain. Cosplicing network patterns remain relatively stable across species.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.