2016
DOI: 10.1093/sysbio/syw074
|View full text |Cite
|
Sign up to set email alerts
|

Phylogenetic Tree Estimation With and Without Alignment: New Distance Methods and Benchmarking

Abstract: Abstract.-Phylogenetic tree inference is a critical component of many systematic and evolutionary studies. The majority of these studies are based on the two-step process of multiple sequence alignment followed by tree inference, despite persistent evidence that the alignment step can lead to biased results. Here we present a two-part study that first presents PaHMMTree, a novel neighbor joining-based method that estimates pairwise distances without assuming a single alignment. We then use simulations to bench… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
25
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
7
1
1

Relationship

2
7

Authors

Journals

citations
Cited by 24 publications
(25 citation statements)
references
References 56 publications
0
25
0
Order By: Relevance
“…In order to compute PPs, our Divvier software integrates a modified version of Zorro, which in turn uses the model initially proposed by MAVID (Bray and Pachter 2004). This model is trained on data that is much more divergent than that usually used in phylogenomic analyses and the fit of that model, along with the resultant PPs, might be improved by using data dependent models such as those implemented in (e.g.,) PaHMM-Tree (Bogusz and Whelan 2017). An alternative to Zorro might also enable divvier to work on nucleotide sequences, although the inherent difficulty of aligning four character sequences means that clusters of high confidence characters might be harder to identify (Yang 2014).…”
Section: Discussionmentioning
confidence: 99%
“…In order to compute PPs, our Divvier software integrates a modified version of Zorro, which in turn uses the model initially proposed by MAVID (Bray and Pachter 2004). This model is trained on data that is much more divergent than that usually used in phylogenomic analyses and the fit of that model, along with the resultant PPs, might be improved by using data dependent models such as those implemented in (e.g.,) PaHMM-Tree (Bogusz and Whelan 2017). An alternative to Zorro might also enable divvier to work on nucleotide sequences, although the inherent difficulty of aligning four character sequences means that clusters of high confidence characters might be harder to identify (Yang 2014).…”
Section: Discussionmentioning
confidence: 99%
“…Besides, for very large families, such as the Gr , the bootstrap analysis was unfeasible in the practice due to excessive computation times. Given that PaHMM-Tree is an alignment-free approach, which allow us to utilize all the amino acid positions to reconstruct the trees, and that the results obtained by Bogusz and Whelan (2017) point to a better performance of this approach for highly divergent sequences without the need for a previous filtering step, here, we decided to report the results based on this method. However, a more exhaustive study comparing these and other tree reconstruction methods, using both real and simulated data and under different degrees of divergence, would be necessary to know whether this method actually improves the phylogenetic analysis.…”
Section: Discussionmentioning
confidence: 99%
“…2006 ), building a reliable MSA to estimate the phylogenetic relationships is not straightforward. To address this long-standing problem, we applied the MSA-free HMM distance-based method ( Bogusz and Whelan 2017 ) implemented in the PaHMM-Tree software, which outperforms MSA-based methods when dealing with the high alignment uncertainty that is usually associated with large divergences. All the phylogenies except those of the IR family (see Results for more details about this family) were based on complete sequences.…”
Section: Methodsmentioning
confidence: 99%
“…The phylogenetic tree was constructed using Neighbour Joining (NJ) method 14 with tree evaluation step for 1000 bootstrap replicates. The resulted rooted tree ( sequence within clade-I with branch frequency of 100% which pointed out regarding their significant evolutionary closeness.…”
Section: Molecular Phylogeny Ascertained Sequential Divergence Of Sarmentioning
confidence: 99%