2019
DOI: 10.1101/589069
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Discovery of ongoing selective sweeps withinAnophelesmosquito populations using deep learning

Abstract: Identification of partial sweeps, which include both hard and soft sweeps that have not currently reached fixation, provides crucial information about ongoing evolutionary responses. To this end, we introduce a deep learning approach that uses a convolutional neural network for image processing, which is trained with coalescent simulations incorporating population-specific history, to discover selective sweeps from population genomic data. This approach distinguishes between completed versus partial sweeps, ha… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
11
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(12 citation statements)
references
References 53 publications
1
11
0
Order By: Relevance
“…This could bias our method towards inferring sweep scenarios from the training data of fixed sweeps that maintained the highest levels of diversity, which should be RNM soft sweeps with weak selection. Our results confirm the previous findings of Xue et al (2021) that models trained on fixed sweeps are not robust when applied to partial sweeps.…”
Section: Resultssupporting
confidence: 92%
See 1 more Smart Citation
“…This could bias our method towards inferring sweep scenarios from the training data of fixed sweeps that maintained the highest levels of diversity, which should be RNM soft sweeps with weak selection. Our results confirm the previous findings of Xue et al (2021) that models trained on fixed sweeps are not robust when applied to partial sweeps.…”
Section: Resultssupporting
confidence: 92%
“…Importantly, due to the flexibility provided by simulations, which can explore large regions of parameter space and be designed to represent any particular organism and locus of interest, supervised machine learning could provide a powerful approach for making sweep inferences for a variety of organisms and scenarios. Indeed, several implementations of supervised machine learning for sweep inferences have already been successfully demonstrated in recent years (Flagel et al, 2019; Kern & Schrider, 2018; Lin et al, 2011; Mughal & DeGiorgio, 2019; Pavlidis et al, 2010; Pybus et al, 2015; Ronen et al, 2013; Schrider & Kern, 2016; Sheehan & Song, 2016; Sugden et al, 2018; Torada et al, 2019; Xue et al, 2021). By their nature of learning by example from diverse training data, these methods are naturally capable of learning patterns across individual sweeps with highly stochastic signatures and across a variety of analysis hyperparameters such as window size.…”
Section: Introductionmentioning
confidence: 99%
“…First, multilayer perceptron (MLP) were used to process large sets of summary statistics and to predict jointly selective sweeps and simple demographic changes (Sheehan and Song, 2016), or to disentangle between multiple scenarios of archaic introgression thanks to an additional ABC step (Lorente-Galdos et al, 2019, Mondal et al, 2019). A second type of ANN, convolutional neural networks (CNN), were then applied to summary statistics computed over 5Kb genomic regions in order to predict selective sweeps (Xue et al, 2019). A considerable shift occurred when several studies applied ANN directly on genomic data instead of using summary statistic.…”
Section: Introductionmentioning
confidence: 99%
“…Such studies have been ballyhooed as "sophisticated," "cutting-edge," "robust," and "valuable," and it has been argued that they "make a strong case for the idea that machine learning methods could be useful for addressing diverse questions in molecular evolution" [5]. The method has since been applied to various other organisms [e.g., 6,7,8].…”
Section: Introductionmentioning
confidence: 99%