Almost all regulation of gene expression in eukaryotic genomes is mediated by the action of distant non-coding transcriptional enhancers upon proximal gene promoters. Enhancer locations cannot be accurately predicted bioinformatically because of the absence of a defined sequence code, and thus functional assays are required for their direct detection. Here we used a massively parallel reporter assay, Self-Transcribing Active Regulatory Region sequencing (STARR-seq), to generate the first comprehensive genome-wide map of enhancers in Anopheles coluzzii, a major African malaria vector in the Gambiae species complex. The screen was carried out by transfecting reporter libraries created from the genomic DNA of 60 wild A. coluzzii from Burkina Faso into A. coluzzii 4a3A cells, in order to functionally query enhancer activity of the natural population within the homologous cellular context. We report a catalog of 3,288 active genomic enhancers that were significant across three biological replicates, 74% of them located in intergenic and intronic regions. The STARR-seq enhancer screen is chromatin-free and thus detects inherent activity of a comprehensive catalog of enhancers that may be restricted in vivo to specific cell types or developmental stages. Testing of a validation panel of enhancer candidates using manual luciferase assays confirmed enhancer function in 26 of 28 (93%) of the candidates over a wide dynamic range of activity from two to at least 16-fold activity above baseline. The enhancers occupy only 0.7% of the genome, and display distinct composition features. The enhancer compartment is significantly enriched for 15 transcription factor binding site signatures, and displays divergence for specific dinucleotide repeats, as compared to matched non-enhancer genomic controls. The genome-wide catalog of A. coluzzii enhancers is publicly available in a simple searchable graphic format. This enhancer catalogue will be valuable in linking genetic and phenotypic variation, in identifying regulatory elements that could be employed in vector manipulation, and in better targeting of chromosome editing to minimize extraneous regulation influences on the introduced sequences.Importance: Understanding the role of the non-coding regulatory genome in complex disease phenotypes is essential, but even in well-characterized model organisms, identification of regulatory regions within the vast non-coding genome remains a challenge. We used a large-scale assay to generate a genome wide map of transcriptional enhancers. Such a catalogue for the important malaria vector, Anopheles coluzzii, will be an important research tool as the role of non-coding regulatory variation in differential susceptibility to malaria infection is explored and as a public resource for research on this important insect vector of disease.
Background Anopheles cell lines are used in a variety of ways to better understand the major vectors of malaria in sub-Saharan Africa. Despite this, commonly used cell lines are not well characterized, and no tools are available for cell line identification and authentication. Methods Utilizing whole genome sequencing, genomes of 4a-3A and 4a-3B ‘hemocyte-like’ cell lines were characterized for insertions and deletions (indels) and SNP variation. Genomic locations of distinguishing sequence variation and species origin of the cell lines were also examined. Unique indels were targeted to develop a PCR-based cell line authentication assay. Mitotic chromosomes were examined to survey the cytogenetic landscape for chromosome structure and copy number in the cell lines. Results The 4a-3A and 4a-3B cell lines are female in origin and primarily of Anopheles coluzzii ancestry. Cytogenetic analysis indicates that the two cell lines are essentially diploid, with some relatively minor chromosome structural rearrangements. Whole-genome sequence was generated, and analysis indicated that SNPs and indels which differentiate the cell lines are clustered on the 2R chromosome in the regions of the 2Rb, 2Rc and 2Ru chromosomal inversions. A PCR-based authentication assay was developed to fingerprint three indels unique to each cell line. The assay distinguishes between 4a-3A and 4a-3B cells and also uniquely identifies two additional An. coluzzii cell lines tested, Ag55 and Sua4.0. The assay has the specificity to distinguish four cell lines and also has the sensitivity to detect cellular contamination within a sample of cultured cells. Conclusions Genomic characterization of the 4a-3A and 4a-3B Anopheles cell lines was used to develop a simple diagnostic assay that can distinguish these cell lines within and across research laboratories. A cytogenetic survey indicated that the 4a-3A and Sua4.0 cell lines carry essentially normal diploid chromosomes, which makes them amenable to CRISPR/Cas9 genome editing. The presented simple authentication assay, coupled with screening for mycoplasma, will allow validation of the integrity of experimental resources and will promote greater experimental reproducibility of results. Graphical abstract
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.