2021
DOI: 10.3390/insects12070591
|View full text |Cite
|
Sign up to set email alerts
|

Annotating the Insect Regulatory Genome

Abstract: An ever-growing number of insect genomes is being sequenced across the evolutionary spectrum. Comprehensive annotation of not only genes but also regulatory regions is critical for reaping the full benefits of this sequencing. Driven by developments in sequencing technologies and in both empirical and computational discovery strategies, the past few decades have witnessed dramatic progress in our ability to identify cis-regulatory modules (CRMs), sequences such as enhancers that play a major role in regulating… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

4
2

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 133 publications
(183 reference statements)
0
6
0
Order By: Relevance
“…It is possible that some of these sequences function as silencers, and therefore their activity could not be observed in the reporter transgene assays. However, we note that the SCRMshaw training set we used, consisting of only eight CREs, is significantly smaller than the recommended training set size of 20–30 sequences [ 9 ]. These six pCREs may therefore simply represent false-positive predictions, putting the false-positive rate somewhere around 33%.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…It is possible that some of these sequences function as silencers, and therefore their activity could not be observed in the reporter transgene assays. However, we note that the SCRMshaw training set we used, consisting of only eight CREs, is significantly smaller than the recommended training set size of 20–30 sequences [ 9 ]. These six pCREs may therefore simply represent false-positive predictions, putting the false-positive rate somewhere around 33%.…”
Section: Resultsmentioning
confidence: 99%
“…Unfortunately, though, for many CREs few if any of their interacting transcription factors are known, nullifying the utility of motif-based approaches. As an alternative, motif-blind approaches have been developed that rely solely on the sequences of a set of known similarly-functioning CREs for predicting new CREs [ 8 , 9 ]. Such in silico approaches offer an opportunity to better resolve the CRE breadth of partially-characterized GRNs, and create a wealth of opportunities to use in vivo methods to reveal the target genes for these new CREs, and to study their evolutionary histories.…”
Section: Introductionmentioning
confidence: 99%
“…Su et al [ 52 ] used REDfly data to assess CRM-discovery approaches, which would have been impossible without REDfly. Computational CRM-discovery methods using REDfly for training data also can identify CRMs in diverse insect species [ 53 ] and, as such, provide a powerful tool for annotating insect regulatory genomes [ 54 ].…”
Section: Utility Of Redflymentioning
confidence: 99%
“…We also expect there to be considerable growth in several data categories. These include CRM_segments , as researchers increasingly turn to CRISPR/Cas9-mediated deletion of regulatory sequences, and pCRMs from multiple new species as experimental methods, such as single-cell ATAC-seq [ 73 ] and computational CRM discovery methods, such as SCRMshaw (reviewed by [ 54 ]) continue to be applied to sequenced insect species at a rapid rate.…”
Section: Use Casesmentioning
confidence: 99%
“…Promisingly, recent technical advances have made it easier to work with traditionally non-model species and their more highly diverged relatives. ATAC-seq has emerged as an affordable method for CRM detection starting from small cell populations [71,72], and computational CRM discovery approaches such as SCRMshaw have shown effectiveness in predicting putatively homologous CRMs in a cross-species manner [73][74][75]. As more species are sequenced, this increases the ability to find relevant CRMs for the GRNs being studied, although the quality of genome assembly and annotation and the need for robust gene orthology mapping remain important limitations.…”
Section: Challenges For the Futurementioning
confidence: 99%