2020
DOI: 10.1101/2020.07.11.192575
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Constrained non-coding sequence provides insights into regulatory elements and loss of gene expression in maize

Abstract: DNA sequencing technology has advanced so quickly, identifying key functional regions using evolutionary approaches is required to understand how those genomes work. This research develops a sensitive sequence alignment approach to identify functional constrained non-coding sequences in the Andropogoneae tribe. The grass tribe Andropogoneae contains several crop species descended from a common ancestor ~18 million years ago. Despite broadly similar phenotypes, they have tremendous genomic diversity with a broa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
3
1

Relationship

5
2

Authors

Journals

citations
Cited by 10 publications
(10 citation statements)
references
References 72 publications
0
10
0
Order By: Relevance
“…The phylogeny was visualised and re-rooted on the longest internal branch in iTOL(142).Population Genetic AnalysisGERP. Soft masked copies of 13 angiosperm genomes were aligned to the unmasked B73v5 reference genome using LAST(143)(144)(145)(146)(147)(148). Repetitive elements in B73v5 were then masked in the aligned sequences.…”
mentioning
confidence: 99%
“…The phylogeny was visualised and re-rooted on the longest internal branch in iTOL(142).Population Genetic AnalysisGERP. Soft masked copies of 13 angiosperm genomes were aligned to the unmasked B73v5 reference genome using LAST(143)(144)(145)(146)(147)(148). Repetitive elements in B73v5 were then masked in the aligned sequences.…”
mentioning
confidence: 99%
“…Studies identifying CPuORFs have also been largely focused on dicots. However, many of the uORFs containing derepressive and repressive variants that we identified are conserved between members of the Andropogoneae tribe 41 , although they are not significantly more or less conserved than all other uORFs (Supplemental Figure 3).…”
Section: Resultsmentioning
confidence: 93%
“…Alignments between maize and five other grasses from the Andropogoneae tribe (45) were filtered to regions that overlap the annotated B73 5' UTRs. uORFs, identified above, were classified based on whether at least one other species had alignments covering 90% or more of the uORF.…”
Section: Conservation Among Andropogoneaementioning
confidence: 99%
“…A drawback is that repeat elements not similar to those in the library will not be masked and, conversely, non-repetitive functional elements with similarities to repeats may be erroneously masked (Bayer, Edwards, and Batley 2018). To compare kmer-based masking approaches to RepeatMasker, we therefore also conducted masking with RED and a novel kmer-based approach (Song et al, 2020) that we refer to as KMER.…”
Section: Methodsmentioning
confidence: 99%
“…Appropriate repeat masking can improve multiple alignment performance A major obstacle to accurate and efficient alignment is the large proportion of repetitive sequence found in most plant genomes. In contrast to masking tools like RepeatMasker that rely on repeat databases, approaches such as RED (Girgis 2015) or KMER (Song et al 2020) try to avoid database bias by using repetitive k-mers (nucleotide sequences of k length) in the genome to identify repeats. Here, we compared RepeatMasker, RED and KMER and tested their impact on subsequent multiple sequence alignment in grasses.…”
Section: Selecting Alignment Benchmarking Metricsmentioning
confidence: 99%