Most species are structured and influenced by processes that either increased or reduced gene flow between populations. However, most population genetic inference methods assume panmixia and reconstruct a history characterized by population size changes. This is potentially problematic as population structure can generate spurious signals of population size change through time. Moreover, when the model assumed for demographic inference is misspecified, genomic data will likely increase the precision of misleading if not meaningless parameters. For instance, if data were generated under an n-island model (characterized by the number of islands and migrants exchanged) inference based on a model of population size change would produce precise estimates of a bottleneck that would be meaningless. In addition, archaeological or climatic events around the bottleneck's timing might provide a reasonable but potentially misleading scenario. In a context of model uncertainty (panmixia versus structure) genomic data may thus not necessarily lead to improved statistical inference. We consider two haploid genomes and develop a theory that explains why any demographic model with structure will necessarily be interpreted as a series of changes in population size by inference methods ignoring structure. We formalize a parameter, the inverse instantaneous coalescence rate, and show that it is equivalent to a population size only in panmictic models, and is mostly misleading for structured models. We argue that this issue affects all population genetics methods ignoring population structure which may thus infer population size changes that never took place. We apply our approach to human genomic data.
Several inferential methods using genomic data have been proposed to quantify and date population size changes in the history of species. At the same time an increasing number of studies have shown that population structure can generate spurious signals of population size change. Recently, Mazet et al. (2016) introduced, for a sample size of two, a time-dependent parameter, which they called the IICR (inverse instantaneous coalescence rate). The IICR is equivalent to a population size in panmictic models, but not necessarily in structured models. It is characterised by a temporal trajectory that suggests population size changes, as a function of the sampling scheme, even when the total population size was constant. Here, we extend the work of Mazet et al. (2016) by (i) showing how the IICR can be computed for any demographic model of interest, under the coalescent, (ii) applying this approach to models of population structure (1D and 2D stepping stone, split models, two- and three-island asymmetric gene flow, continent-island models), (iii) stressing the importance of the sampling strategy in generating different histories, (iv) arguing that IICR plots can be seen as summaries of genomic information that can thus be used for model choice or model exclusion (v) applying this approach to the question of admixture between humans and Neanderthals. Altogether these results are potentially important given that the widely used PSMC (pairwise sequentially Markovian coalescent) method of Li and Durbin (2011) estimates the IICR of the sample, not necessarily the history of the populations.
A significant part of eukaryotic noncoding DNA is viewed as the passive result of mutational processes, such as the proliferation of mobile elements. However, sequences lacking an immediate utility can nonetheless play a major role in the long-term evolvability of a lineage, for instance by promoting genomic rearrangements. They could thus be subject to an indirect selection. Yet, such a long-term effect is difficult to isolate either in vivo or in vitro. Here, by performing in silico experimental evolution, we demonstrate that, under low mutation rates, the indirect selection of variability promotes the accumulation of noncoding sequences: Even in the absence of self-replicating elements and mutational bias, noncoding sequences constituted an important fraction of the evolved genome because the indirectly selected genomes were those that were variable enough to discover beneficial mutations. On the other hand, high mutation rates lead to compact genomes, much like the viral ones, although no selective cost of genome size was applied: The indirectly selected genomes were those that were small enough for the genetic information to be reliably transmitted. Thus, the spontaneous evolution of the amount of noncoding DNA strongly depends on the mutation rate. Our results suggest the existence of an additional pressure on the amount of noncoding DNA, namely the indirect selection of an appropriate trade-off between the fidelity of the transmission of the genetic information and the exploration of the mutational neighborhood. Interestingly, this trade-off resulted robustly in the accumulation of noncoding DNA so that the best individual leaves one offspring without mutation (or only neutral ones) per generation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.