2006
DOI: 10.1186/1471-2148-6-29
|View full text |Cite
|
Sign up to set email alerts
|

Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified

Abstract: Background: In recent years, model based approaches such as maximum likelihood have become the methods of choice for constructing phylogenies. A number of authors have shown the importance of using adequate substitution models in order to produce accurate phylogenies. In the past, many empirical models of amino acid substitution have been derived using a variety of different methods and protein datasets. These matrices are normally used as surrogates, rather than deriving the maximum likelihood model from the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
432
0
2

Year Published

2007
2007
2024
2024

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 969 publications
(435 citation statements)
references
References 57 publications
1
432
0
2
Order By: Relevance
“…These analyses demonstrated no candidate chimaera sequences. The final masked alignment was analysed using MODELGEN-ERATOR (Keane et al 2004) to identify the most appropriate model for phylogenetic analyses (GTRCG (four discrete categories, aZ0.57)Cproportion of invariant sites (0.49)). Then, the phylogeny was calculated using two methods: (i) PHYML (Guindon et al 2005) using the model parameters estimated using MODELGENERATOR and 100 bootstrap replicates, and (ii) MRBAYES3 (Ronquist & Huelsenbeck 2003) analysis was conducted for 1 000 000 generation samples using model of site rate variation as mentioned above but allowing the MCMCMC to search alternative model parameter values.…”
Section: Methodsmentioning
confidence: 99%
“…These analyses demonstrated no candidate chimaera sequences. The final masked alignment was analysed using MODELGEN-ERATOR (Keane et al 2004) to identify the most appropriate model for phylogenetic analyses (GTRCG (four discrete categories, aZ0.57)Cproportion of invariant sites (0.49)). Then, the phylogeny was calculated using two methods: (i) PHYML (Guindon et al 2005) using the model parameters estimated using MODELGENERATOR and 100 bootstrap replicates, and (ii) MRBAYES3 (Ronquist & Huelsenbeck 2003) analysis was conducted for 1 000 000 generation samples using model of site rate variation as mentioned above but allowing the MCMCMC to search alternative model parameter values.…”
Section: Methodsmentioning
confidence: 99%
“…However, if evolutionary history of the evolving objects under study has contained introgressive events, then a simple diverging tree cannot be used. To use a tree when the data were not generated by a tree-like process is an error in model selection [92]. There are many network-like alternatives to trees, including phylogenetic networks [93], sequence-sharing networks [32] and N-rooted fusion networks [31,33].…”
Section: Analysis Of Path Lengths On Four Different Hypothesesmentioning
confidence: 99%
“…For PHYML analyses, we merged these partitions into one alignment, as PHYML does not handle multiple partitions. The best substitution model for each partition/merged dataset (with and without outgroups) was determined with MODELGENERATOR v. 0.85 [20] on the basis of its Bayesian information criterion, and applied to the corresponding partition/dataset in BEAST (unlinked substitution models) and PHYML analyses (electronic supplementary material, tables S6-S7). BEAST analyses were performed assuming a birth-death serially sampled tree model, and divergence dates were estimated using a log-uncorrelated molecular clock model together with tip-sampling (electronic supplementary material, table S8) and assuming a time for the MRCA at 4.0-4.5 Ma for Equus [2].…”
Section: (D) Phylogenetic Analysesmentioning
confidence: 99%