The mammalian genome is depleted in CG dinucleotides, except at protected regions where they cluster as CpG islands (CGIs). CGIs are gene regulatory hubs and serve as transcription initiation sites and are as expected, associated with gene promoters. Advances in genomic annotations demonstrate that a quarter of CGIs are found within genes. Such intragenic regions are repressive environments, so it is surprising that CGIs reside here and even more surprising that some resist repression and are transcriptionally active within a gene. Hence, intragenic CGI positioning within genes is not arbitrary and is instead, selected for. As a wealth of recent studies demonstrate, intragenic CGIs are embedded within genes and consequently, influence ‘host’ gene mRNA isoform length and expand transcriptome diversity.
A large proportion of genes in mammalian genomes are embedded within another gene, sharing nucleotide sequence. Characterisation of these host/nested gene pairs is required to understand their transcriptional crosstalk. We identify all host/nested gene pairs and reveal they are found at over a sixth of genes in both human and mouse. Host genes were more highly expressed and more likely to be protein-coding and their nested gene partners mainly reside within the largest intron of their host. Individual analysis of host or nested gene expression did not reveal tissue specific profiles. Co-expression profiles did reveal tissue specific profiles, suggesting host/nested gene crosstalk plays a role during differentiation and development. To assess true co-expression or mutual expression testis scRNA-seq data were used, revealing that pairs can switch expression dynamically between these states during spermatogenesis. Host genes have a larger pool of isoforms and when co-expression occurred at some testis-specific pairs, host transcript diversity increased. Scanning experimentally validated polyadenylation sites upstream of the nested gene, shows that host polyadenylation sites are enriched, implying that alternative polyadenylation is one of the mechanisms influenced by host/nested gene co-expression. The collection of host/nested genes and our analysis is available on an Rshiny application (https://hngeneviewer.sites.er.kcl.ac.uk/hn_viewer/).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.