2021
DOI: 10.1101/2021.06.27.450081
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Neural ADMIXTURE: rapid population clustering with autoencoders

Abstract: Characterizing the genetic substructure of large cohorts has become increasingly important as genetic association and prediction studies are extended to massive, increasingly diverse, biobanks. ADMIXTURE and STRUCTURE are widely used unsupervised clustering algorithms for characterizing such ancestral genetic structure. These methods decompose individual genomes into fractional cluster assignments with each cluster representing a vector of DNA marker frequencies. The assignments, and clusters, provide an inter… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
2

Relationship

3
3

Authors

Journals

citations
Cited by 10 publications
(15 citation statements)
references
References 34 publications
(49 reference statements)
0
15
0
Order By: Relevance
“…We use a subset of the dataset presented in [29]. This dataset includes human labeled genomic sequences from several publicly available databases.…”
Section: Resultsmentioning
confidence: 99%
“…We use a subset of the dataset presented in [29]. This dataset includes human labeled genomic sequences from several publicly available databases.…”
Section: Resultsmentioning
confidence: 99%
“…For the correction of PRS models, we make use of estimates of global ancestry. For this purpose, we use Neural ADMIXTURE [56], a faster adaptation of the ADMIXTURE algorithm [57] with similar (or better) clustering results. Utilizing the Python implementation of Neural ADMIXTURE, we use data from the 1000 Genomes Project Consortium [58] for training a model in the supervised mode of Neural ADMIXTURE with the default parameters.…”
Section: Genetic Ancestrymentioning
confidence: 99%
“…For the correction of PRS models, we make use of estimates of global ancestry. For this purpose, we use Neural ADMIXTURE [56], a faster adaptation of the ADMIXTURE algorithm [57]…”
Section: Genetic Ancestrymentioning
confidence: 99%
“…Direct computation on compressed genotypes, including all GPU kernels, is also implemented in OpenADMIXTURE. Thus, OpenADMIX-TURE can analyze 16-32 times more data per GPU than GPU-based admixture software such as Mantes et al 38 , whose GPU kernels require a single precision or double precision genotype matrix as input.…”
Section: Direct Computation On the Plink Bed Filementioning
confidence: 99%