2021
DOI: 10.1101/2021.06.01.446647
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A multiple genome alignment workflow shows the impact of repeat masking and parameter tuning on alignment of functional regions in plants

Abstract: Alignments of multiple genomes are a cornerstone of comparative genomics, but generating these alignments remains technically challenging and often impractical. We developed the msa_pipeline workflow (https://bitbucket.org/bucklerlab/msa_pipeline) based on the LAST aligner to allow practical and sensitive multiple alignment of diverged plant genomes with minimal user inputs. Our workflow only requires a set of genomes in FASTA format as input. The workflow outputs multiple alignments in MAF format, and include… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 21 publications
0
1
0
Order By: Relevance
“…Based on the observation that long indels are generally derived from TE movement, while different mechanisms introduce shorter indel mutations ( 11 ), the two-piece affine gap cost strategy enables the alignment of short and long indels. The performance of other genome alignment implementations may improve by fine-tuning gap parameters ( 47 ), but, with the exception of minimap2, all other tested tools align long indels and short indels using the same gap penalty profile. This precludes parameter tuning as a solution to simultaneously optimize the alignment of long indels and short indels.…”
Section: Discussionmentioning
confidence: 99%
“…Based on the observation that long indels are generally derived from TE movement, while different mechanisms introduce shorter indel mutations ( 11 ), the two-piece affine gap cost strategy enables the alignment of short and long indels. The performance of other genome alignment implementations may improve by fine-tuning gap parameters ( 47 ), but, with the exception of minimap2, all other tested tools align long indels and short indels using the same gap penalty profile. This precludes parameter tuning as a solution to simultaneously optimize the alignment of long indels and short indels.…”
Section: Discussionmentioning
confidence: 99%