2019
DOI: 10.1016/j.molp.2019.02.008
|View full text |Cite
|
Sign up to set email alerts
|

TIR-Learner, a New Ensemble Method for TIR Transposable Element Annotation, Provides Evidence for Abundant New Transposable Elements in the Maize Genome

Abstract: Transposable elements (TEs) make up a large and rapidly evolving proportion of plant genomes. Among Class II DNA TEs, TIR elements are flanked by characteristic terminal inverted repeat sequences (TIRs). TIR TEs may play important roles in genome evolution, including generating allelic diversity, inducing structural variation, and regulating gene expression. However, TIR TE identification and annotation has been hampered by the lack of effective tools, resulting in erroneous TE annotations and a significant un… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
100
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 121 publications
(100 citation statements)
references
References 76 publications
0
100
0
Order By: Relevance
“…These elements are highly abundant in eukaryotic genomes, and, as such, there are a large number of annotation programs designed to identify them. We tested P-MITE [31], a specialized database of curated plant MITEs, and IRF [50], TIR-Learner [17], and GRF ( grf-main -c 0 ) (https://github.com/bioinfolabmu/GenericRepeatFinder), which structurally identify TIR elements, and finally MITE-Hunter [51], detectMITE [52], MUSTv2 [53], miteFinderII [54], MITE-Tracker [55], and GRF ( grf-mite ), which structurally identify MITEs specifically.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…These elements are highly abundant in eukaryotic genomes, and, as such, there are a large number of annotation programs designed to identify them. We tested P-MITE [31], a specialized database of curated plant MITEs, and IRF [50], TIR-Learner [17], and GRF ( grf-main -c 0 ) (https://github.com/bioinfolabmu/GenericRepeatFinder), which structurally identify TIR elements, and finally MITE-Hunter [51], detectMITE [52], MUSTv2 [53], miteFinderII [54], MITE-Tracker [55], and GRF ( grf-mite ), which structurally identify MITEs specifically.…”
Section: Resultsmentioning
confidence: 99%
“…We found less than half of the novel TIR elements with novel TIRs had more than three copies in the rice genome (Figure 5D). This is because TIR candidates were not filtered based on copy number in TIR-Learner [17], given that some TEs may share similar TIRs but different internal regions (Figure 5D). Still, some of these could be contaminants such as LTR sequences.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…However, this approach can be difficult to implement for TEs and other repetitive sequences. Prior studies of several loci had suggested high levels of variation in TE content among maize haplotypes (Fu and Dooner, ; Yao et al ., ; Brunner et al ., ), and genomic level comparisons using whole‐genome assemblies have been limited to assessing annotated copy number per family without resolution at the level of individual TEs (Springer et al ., ; Su et al ., ). In this study we used an approach to assess the shared and non‐shared nature of individual TEs within collinear homologous blocks of four assembled maize genomes.…”
Section: Discussionmentioning
confidence: 97%
“…There is much literature about applications of machine learning in bioinformatics (for example, reviewed in (Larrañaga et al, 2006)), showing improvements in many aspects such as genome annotation (Arango-López et al, 2017). In recent years, much bioinformatics software has been developed to detect TEs (Girgis, 2015) and, although they follow different strategies (such as homology-based, structure-based, de novo, and using comparative genomics), these lack sensitivity and specificity due to the polymorphic structures of TEs (Su, Gu & Peterson, 2019). Loureiro et al (Loureiro et al, 2013a) proved that ML could be used to improve the accuracy of TEs detection by combining results obtained by several conventional software and training a classifier using these results (Schietgat et al, 2018), (Loureiro et al, 2013b).…”
Section: Benefits Of ML Over Bioinformatics (Q1)mentioning
confidence: 99%