2015
DOI: 10.1101/021261
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Inference under a Wright-Fisher model using an accurate beta approximation

Abstract: The large amount and high quality of genomic data available today enables, in principle, accurate inference of evolutionary history of observed populations. The Wright-Fisher model is one of the most widely used models for this purpose. It describes the stochastic behavior in time of allele frequencies and the influence of evolutionary pressures, such as mutation and selection. Despite its simple mathematical formulation, exact results for the distribution of allele frequency (DAF) as a function of time are no… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
21
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(22 citation statements)
references
References 37 publications
1
21
0
Order By: Relevance
“…It is built upon combining a Hidden Markov Modeling approach, first proposed by Bollback et al (2008) in the context of genetic time series and parametric approximations to the Wright Fisher process. Within this framework we have shown that using the Beta-with-spikes distribution proposed by Tataru et al (2015) to approximate the Wright Fisher transitions provided a statistical inference comparable to that of the Wright-Fisher model while having a computational cost that does not depend on population size. We have shown this framework allows to perform hypothesis testing by relying on the asymptotic distribution of the Likelihood Ratio Statistic and that the maximum likelihood estimate of the selection intensity parameter had generally good statistical properties.…”
Section: Discussionmentioning
confidence: 97%
See 2 more Smart Citations
“…It is built upon combining a Hidden Markov Modeling approach, first proposed by Bollback et al (2008) in the context of genetic time series and parametric approximations to the Wright Fisher process. Within this framework we have shown that using the Beta-with-spikes distribution proposed by Tataru et al (2015) to approximate the Wright Fisher transitions provided a statistical inference comparable to that of the Wright-Fisher model while having a computational cost that does not depend on population size. We have shown this framework allows to perform hypothesis testing by relying on the asymptotic distribution of the Likelihood Ratio Statistic and that the maximum likelihood estimate of the selection intensity parameter had generally good statistical properties.…”
Section: Discussionmentioning
confidence: 97%
“…Using this class of distribution in a multiple populations context therefore requires developing a specific factorization for calculating the multivariate likelihood. An example of such an extension for the problem of estimating branch length in population trees can be found in Tataru et al (2015). For such data, the Gaussian model remains much more practical but still suffers from the limitations mentioned above.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Our inference procedure could also be used in the typical population phylogenetic setting to infer the divergence history of a group of populations, but this application is limited by the relatively small number of loci (< O(1000)) that our method can accept due to the computational costs of likelihood evaluations with the pruning algorithm. A maximum-likelihood implementation of our model, requiring fewer likelihood evaluations, may be applicable to genome-scale SNP data, possibly comparing to Kim Tree (Gautier and Vitalis, 2013) and SpikeyTree (Tataru et al, 2015).…”
Section: Discussionmentioning
confidence: 99%
“…The pruning algorithm also requires a distribution of allele frequencies at the root of the phylogeny, which, in our application (see below), represents the unobservable distribution of heteroplasmy allele frequencies in the mother as an embryo. Following Tataru et al (2015), we use a discretized, symmetric beta distribution with additional, symmetric probability weights at frequencies 0 and 1. The two parameters specifying this distribution are inferred jointly with genetic drift and mutation parameters.…”
Section: Likelihood Calculationmentioning
confidence: 99%