2021
DOI: 10.3389/fbinf.2021.691865
|View full text |Cite
|
Sign up to set email alerts
|

TRAL 2.0: Tandem Repeat Detection With Circular Profile Hidden Markov Models and Evolutionary Aligner

Abstract: The Tandem Repeat Annotation Library (TRAL) focuses on analyzing tandem repeat units in genomic sequences. TRAL can integrate and harmonize tandem repeat annotations from a large number of external tools, and provides a statistical model for evaluating and filtering the detected repeats. TRAL version 2.0 includes new features such as a module for identifying repeats from circular profile hidden Markov models, a new repeat alignment method based on the progressive Poisson Indel Process, an improved installation… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 14 publications
(10 citation statements)
references
References 43 publications
0
10
0
Order By: Relevance
“…To improve the accuracy and power of TR annotation, it has been suggested to apply a statistical framework in combination with a meta-approach that utilizes multiple TR prediction methods (Anisimova et al, 2015). We used Tandem Repeat Annotation Library (TRAL), which allows for evolutionary and statistics analyses to validate our annotated TRs (Schaper et al, 2015; Delucchi et al, 2021). Since MPXV genomes during 2022 outbreak most likely has a single origin (Isidro, et al, 2022), TRs in this study can be defined as related by a common ancestral unit under a Markov substitution model and a standard phylogeny model that reflects the unit duplication history (Schaper et al, 2012).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…To improve the accuracy and power of TR annotation, it has been suggested to apply a statistical framework in combination with a meta-approach that utilizes multiple TR prediction methods (Anisimova et al, 2015). We used Tandem Repeat Annotation Library (TRAL), which allows for evolutionary and statistics analyses to validate our annotated TRs (Schaper et al, 2015; Delucchi et al, 2021). Since MPXV genomes during 2022 outbreak most likely has a single origin (Isidro, et al, 2022), TRs in this study can be defined as related by a common ancestral unit under a Markov substitution model and a standard phylogeny model that reflects the unit duplication history (Schaper et al, 2012).…”
Section: Resultsmentioning
confidence: 99%
“…We used Tandem Repeat Annotation Library (TRAL 2.0), an open source Python 3 algorithm, to identify TR seed motifs from sequence profile databases via seven de novo TR algorithms (HHrepID, TRED, T-REKS, TRUST, XSTREAM, PHOBOS, TRF) (Schaper et al, 2015; Delucchi et al, 2021). Briefly, circular profile hidden Markov models (cpHMMs) were built from these TR seeds and then utilized to annotate TR regions in MPXV sequences.…”
Section: Methodsmentioning
confidence: 99%
“…Many algorithms have been used to detect tandem repeats (TRs) in biological sequences, leading to inconsistent TR annotations of identical sequences (Schaper et al 2012). To address this issue, we used the TRAL Python library (Delucchi et al 2021) which compares, harmonizes, and makes non-redundant, the output of different repeat detection algorithms (TRF, XSTREAM, PHOBOS, and T-REKS; see Methods). We used TRAL to annotate all SLC6A3 sequences for repeats.…”
Section: Resultsmentioning
confidence: 99%
“…In the present work, we used 64 publicly available long-read haplotype-phased genome assemblies from 32 individuals from the Human Genome Structural Variation Consortium (HGSVC) (Ebert et al 2021) and the Tandem Repeat Annotation Library (TRAL) Python library (Delucchi et al 2021) to characterize the VNTR landscape of SLC6A3 . We report the discovery of a novel hyper-variable number tandem repeat (hyVNTR) in the intron 8 region of SLC6A3 that has a consensus sequence length of 38 bp and repeat copy numbers ranging from 3.4-133.4.…”
Section: Introductionmentioning
confidence: 99%
“…In the present work, we used 64 publicly available long-read haplotype-phased genome assemblies from 32 individuals from the Human Genome Structural Variation Consortium ( Ebert et al, 2021 ) and the Tandem Repeat Annotation Library (TRAL) Python library ( Delucchi et al, 2021 ) to characterize the VNTR landscape of SLC6A3 . We report the discovery of a novel hypervariable number tandem repeat (hyVNTR) in the intron 8 region of SLC6A3 that has a consensus sequence length of 38 bp and repeat copy numbers ranging from 3.4 to 133.4.…”
Section: Introductionmentioning
confidence: 99%