2012
DOI: 10.1093/sysbio/sys076
|View full text |Cite
|
Sign up to set email alerts
|

TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees

Abstract: Accurate gene tree reconstruction is a fundamental problem in phylogenetics, with many important applications. However, sequence data alone often lack enough information to confidently support one gene tree topology over many competing alternatives. Here, we present a novel framework for combining sequence data and species tree information, and we describe an implementation of this framework in TreeFix, a new phylogenetic program for improving gene tree reconstructions. Given a gene tree (preferably computed u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
113
0
1

Year Published

2013
2013
2019
2019

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 99 publications
(114 citation statements)
references
References 40 publications
0
113
0
1
Order By: Relevance
“…To evaluate different tree reconstruction methods, simulated gene sequences were generated using the R package 'Phylosim' 92 . We used publicly available simulated data sets for 16 fungi species 93,94 . These data sets contain 1,000 simulated tree topologies of gene families, each of which was generated under observed gene duplication…”
Section: Go Enrichment Analysis Supplementarymentioning
confidence: 99%
See 1 more Smart Citation
“…To evaluate different tree reconstruction methods, simulated gene sequences were generated using the R package 'Phylosim' 92 . We used publicly available simulated data sets for 16 fungi species 93,94 . These data sets contain 1,000 simulated tree topologies of gene families, each of which was generated under observed gene duplication…”
Section: Go Enrichment Analysis Supplementarymentioning
confidence: 99%
“…5), they were considered to represent the same gene, whereafter we retained our own sequences to circumvent incorrect inference of gene duplication events in phylogeny reconciliation. A maximum-likelihood tree was reconstructed using third codon position sequences of the trimAl-processed alignments, and it was subsequently reconciled with a species tree prepared from a dated large-scale plastid phylogeny of flowering plants 54 using Treefix 94 with default parameters, except with the number of iterations increased to 1,000. Although the plastid-based topology 54 is partly different from nuclear-based topology 52,53 ( Supplementary Fig.…”
Section: Nature Ecology and Evolutionmentioning
confidence: 99%
“…To both increase the computational efficiency of Treefix (see below), and improve the accuracy of our rate estimation, we used a custom Perl script (treesplit.pl on Github) to split trees where the deepest node was inferred to be a duplication rather than a speciation event. After the first round of tree splitting, we used the programs Treefix (v. 1.1.8; default options except -m PROTGAMMAWAG, -niter ¼ 1,000, and -maxtime) and treeannotate (part of the treefix package) to reconcile the species tree with each gene tree and compute the likely number of gains and losses on the tree (Wu et al 2012). Treefix attempts to produce the most parsimonious tree with respect to duplications and losses while remaining consistent with the maximum likelihood gene tree.…”
Section: Determining Orthologs and Paralogs Of M Domestica Proteins mentioning
confidence: 99%
“…As reconciled gene trees also yield orthology relationships, it is possible to compare the results from both independent methods. This is what is often done to assess the quality of gene trees [73,74]. Orthologs obtained from synteny are assumed to represent the truth, and can be compared with orthologs obtained from reconciliations.…”
Section: Synteny Informs Gene Family Evolutionmentioning
confidence: 99%