2019
DOI: 10.1101/696922
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

RepeatFiller newly identifies megabases of aligning repetitive sequences and improves annotations of conserved non-exonic elements

Abstract: Transposons and other repetitive sequences make up a large part of complex genomes. Repetitive sequences can be co-opted into a variety of functions and thus provide a source for evolutionary novelty. However, comprehensively detecting ancestral repeats that align between species is difficult since considering all repeat-overlapping seeds in alignment methods that rely on the seed-and-extend heuristic results in prohibitively high runtimes. Here, we show that ignoring repeat-overlapping alignment seeds when al… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
1

Relationship

3
1

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 35 publications
0
5
0
Order By: Relevance
“…We used axtChain (Kent et al, 2003) (default parameters except -linearGap = loose) to build co-linear alignment chains. The chains were subsequently processed by RepeatFiller (Osipova et al, 2019) (default parameters) to incorporate previously-undetected alignments between repetitive sequences and by chainCleaner (Suarez et al, 2017) (default parameters) to improve alignment specificity. Chains were converted to alignment nets using a modified version of chainNet (Kent et al, 2003) that computes real scores of partial nets.…”
Section: Multiple Genome Alignmentmentioning
confidence: 99%
“…We used axtChain (Kent et al, 2003) (default parameters except -linearGap = loose) to build co-linear alignment chains. The chains were subsequently processed by RepeatFiller (Osipova et al, 2019) (default parameters) to incorporate previously-undetected alignments between repetitive sequences and by chainCleaner (Suarez et al, 2017) (default parameters) to improve alignment specificity. Chains were converted to alignment nets using a modified version of chainNet (Kent et al, 2003) that computes real scores of partial nets.…”
Section: Multiple Genome Alignmentmentioning
confidence: 99%
“…The lav files were then converted into psl format and chain together axt alignment using UCSC Genome Browser tools lavToPsl and axtChain. Next, newly detected repeat-overlapping alignments were incorporated into pairwise alignment chains using RepeatFiller ( Osipova et al 2019 ). Highly sensitive local pairwise alignment for loci flanked by aligning blocks was obtained using patchChain.perl ( Hiller et al 2013 ; Sharma and Hiller 2017 ), and the accuracy of local pairwise alignments was improved using chainCleaner ( Suarez et al 2017 ).…”
Section: Methodsmentioning
confidence: 99%
“…Genomes were aligned using LASTZ 1.04.03 [85] with parameters (K = 2400, L = 3000, Y = 9400, H = 2000 and the lastz default scoring matrix). Then, we used axtChain [30] (default parameters except linearGap=loose) to compute co-linear alignment chains, RepeatFiller [86] (default parameters) to capture previously missed alignments between repetitive regions and chainCleaner [87] (default parameters except minBrokenChainScore=75000 and -doPairs) to improve alignment specificity. H. vespertilio was used as reference and H. euphorbiae , B. mori and M. sexta as queries.…”
Section: Methodsmentioning
confidence: 99%
“…To compare contiguity between H. euphorbiae, H. Genomes were aligned using LASTZ 1.04.03 [85] with parameters (K = 2400, L = 3000, Y = 9400, H = 2000 and the lastz default scoring matrix). Then, we used axtChain [30] (default parameters except linearGap=loose) to compute co-linear alignment chains, RepeatFiller [86] (default parameters) to capture previously missed alignments between repetitive regions and chainCleaner [87] (default parameters mapped to the H. euphorbiae genome using BLAT [30,31]. The alignment together with the M. sexta (XM_037446381) BLAT sequence results from both Hyles genomes was used to produce a small phylogenetic tree using the online RaxML [88] BlackBox portal (https://raxml-ng.vital-it.ch/#/) to illustrate orthology and variability.…”
Section: Comparison To Other Speciesmentioning
confidence: 99%