2019
DOI: 10.1186/s12859-019-2927-x
|View full text |Cite
|
Sign up to set email alerts
|

ImaGene: a convolutional neural network to quantify natural selection from genomic data

Abstract: BackgroundThe genetic bases of many complex phenotypes are still largely unknown, mostly due to the polygenic nature of the traits and the small effect of each associated mutation. An alternative approach to classic association studies to determining such genetic bases is an evolutionary framework. As sites targeted by natural selection are likely to harbor important functionalities for the carrier, the identification of selection signatures in the genome has the potential to unveil the genetic mechanisms unde… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

4
162
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 89 publications
(166 citation statements)
references
References 64 publications
4
162
0
Order By: Relevance
“…However, the discrepancy between ABC and SPIDNA reconstructions in the last 10,000 years might also be due to the sensitivity of ANNs to overfitting and to mispecifications in the model generating training data. For example, decrease in performances due to demographic mispecification has already been shown for selection inference based on ANNs (Torada et al, 2019). In our case, model mispecification arises because cattle breeds are subjected to strong artificial selection pressures, with few males contributing to the next generations, which is a clear violation of the coalescent assumptions underlying our training simulated set.…”
Section: Discussionmentioning
confidence: 93%
See 4 more Smart Citations
“…However, the discrepancy between ABC and SPIDNA reconstructions in the last 10,000 years might also be due to the sensitivity of ANNs to overfitting and to mispecifications in the model generating training data. For example, decrease in performances due to demographic mispecification has already been shown for selection inference based on ANNs (Torada et al, 2019). In our case, model mispecification arises because cattle breeds are subjected to strong artificial selection pressures, with few males contributing to the next generations, which is a clear violation of the coalescent assumptions underlying our training simulated set.…”
Section: Discussionmentioning
confidence: 93%
“…The fourth baseline is a newly design CNN with rectangular shaped filters that cover more than one haplotype (see Methods). This “custom CNN” is a first step towards an architecture tailored to raw genomic data, because spatial information is preserved as for recent ANNs applied to population genetics (Chan et al, 2018, Flagel et al, 2018, Torada et al, 2019), but also because asymmetrical filters of various sizes account for the heterogeneous entities of axes (haplotype versus SNP, rather than pixel versus pixel). Finally we adapted and re-trained four networks among the top-ranked CNNs proposed by Flagel et al (2018) so that they could reconstruct a 21-epoch model of instantaneous effective population size rather than the three-epoch model initially investigated by the authors, and for practicability we called them Flagel CNNs.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations