Abstract!Phylogenetic comparative methods (PCMs), especially ones based on linear models, have played a central role in understanding species' trait evolution. These methods, however, usually assume that phylogenetic trees are known without error or uncertainty, but this assumption is most likely incorrect. So far, Markov chain Monte Carlo, MCMC-based Bayesian methods have successfully been deployed to account for such phylogenetic uncertainty in PCMs. Yet, the use of these methods seems to have been limited, probably due to difficulties in their implementation. Here, we propose an approach with which phylogenetic uncertainty is incorporated in a simple, readily implementable and reliable manner. Our approach uses Rubin's rules, which are an integral part of a standard multiple imputation procedure, often employed to recover missing data. In our case, we see the true phylogenetic tree as a missing piece of data, and apply Rubin's rules to amalgamate parameter estimates from a number of models using a set of phylogenetic trees (e.g. a Bayesian posterior distribution of phylogenetic trees). Using a simulation study, we demonstrate that our approach using Rubin's rules performs better in accounting for phylogenetic uncertainty than alternative methods such as MCMC-based Bayesian and Akaike information criterion, AIC-based model averaging approaches; that is, on average, our approach has the best 95% confidence/credible interval coverage among all. A unique property of the multiple imputation procedure is that the index, named 'relative efficiency', could be used to quantify the number of trees required for incorporating phylogenetic uncertainty. Thus, by using the relative efficiency, we show the required tree number is surprisingly small (~50 trees) at least in our simulation. In addition to these advantages above, our approach could be combined seamlessly with PCMs that utilize multiple imputation to recover missing data. Given the ubiquity of missing data, it is likely that the use of the multiple imputation procedure with Rubin's rules will be popular to deal with phylogenetic uncertainty as well as missing data in comparative data. (Lynch 1991;Hadfield and Nakagawa 2010). When one phylogenetic tree is used in analysis, all these methods assume that the phylogeny of organisms is known without error.
KeywordsHowever, no phylogenetic trees (or hypotheses) are known without error. Errors come in the form of uncertainty in branch length, topology, and also in the model of assumed character evolution.Researchers have been investigating the impact of these types of uncertainty on statistical inference (e.g., Díaz-Uriarte & Garland 1996;Symonds 2002). These studies generally suggest the importance of incorporating 'phylogenetic uncertainty' in PCMs; note that by using one tree, point estimates (e.g. regression coefficients) are not necessarily biased (Stone 2011), but uncertainty estimates (e.g. standard error or confidence intervals) are not accurate. Therefore, a number of methods have been proposed to include phyl...