2021
DOI: 10.1093/molbev/msab266
|View full text |Cite
|
Sign up to set email alerts
|

A Probabilistic Model for Indel Evolution: Differentiating Insertions from Deletions

Abstract: Insertions and deletions (indels) are common molecular evolutionary events. However, probabilistic models for indel evolution are under-developed due to their computational complexity. Here we introduce several improvements to indel modeling: (1) While previous models for indel evolution assumed that the rates and length distributions of insertions and deletions are equal, here we propose a richer model that explicitly distinguishes between the two; (2) We introduce numerous summary statistics that allow Appro… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
27
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 25 publications
(30 citation statements)
references
References 70 publications
2
27
1
Order By: Relevance
“…Therefore, given an empirical dataset (or datasets) one could use the presented method to extract estimations for the posterior distribution of the genome rearrangement parameters. Similar tools can be used to extract other evolutionary parameters, for example SPARTA-ABC ( Loewenthal et al 2021 ) can be used to extract posterior distributions of parameters relevant for small insertions and deletions. These posterior distributions can be used to simulate an entire genome data, which will better reflect empirical datasets compared to data for which the parameters were drawn randomly.…”
Section: Discussionmentioning
confidence: 99%
“…Therefore, given an empirical dataset (or datasets) one could use the presented method to extract estimations for the posterior distribution of the genome rearrangement parameters. Similar tools can be used to extract other evolutionary parameters, for example SPARTA-ABC ( Loewenthal et al 2021 ) can be used to extract posterior distributions of parameters relevant for small insertions and deletions. These posterior distributions can be used to simulate an entire genome data, which will better reflect empirical datasets compared to data for which the parameters were drawn randomly.…”
Section: Discussionmentioning
confidence: 99%
“…In the last 10 years, several models and computational tools to predict different effects of residue coevolution, such as the nonlinearity of effects caused by substitutions and their different combinations, have been created. , An overview of the main computational concepts of these models and an explanation of the data they use to infer information about residue coevolution will be provided later. Furthermore, computational methods have been reported to model the functional effects and evolution of indels and a protein design tool to predict amino acid indels (and substitutions) for protein engineering.…”
Section: Enzyme Engineering Strategiesmentioning
confidence: 99%
“…53,54 An overview of the main computational concepts of these models and an explanation of the data they use to infer information about residue coevolution will be provided later. Furthermore, computational methods have been reported to model the functional effects 55 and evolution 56 of indels and a protein design tool to predict amino acid indels (and substitutions) 57 for protein engineering.…”
Section: Directed Evolutionmentioning
confidence: 99%
“…Thus, we expect that ANNs, which can implicitly construct indel models from training MSA by ANNs, may have the potential to improve branch length estimation relative to traditional methods that ignore indel information. Indel events provide useful information for evolutionary inference for highly diverged sequences (Rokas and Holland 2000;Loewenthal et al 2021) or in scenarios where the number of substitutions is insufficient for reliable estimation (Redelings and Suchard 2007), so indels could have a positive effect on branch length estimation for trees with long or short branches. Acknowledgements…”
Section: Concluding Remarks and Path Forwardmentioning
confidence: 99%