2011
DOI: 10.1186/1471-2105-12-263
|View full text |Cite
|
Sign up to set email alerts
|

Characterization of a Bayesian genetic clustering algorithm based on a Dirichlet process prior and comparison among Bayesian clustering methods

Abstract: BackgroundA Bayesian approach based on a Dirichlet process (DP) prior is useful for inferring genetic population structures because it can infer the number of populations and the assignment of individuals simultaneously. However, the properties of the DP prior method are not well understood, and therefore, the use of this method is relatively uncommon. We characterized the DP prior method to increase its practical use.ResultsFirst, we evaluated the usefulness of the sequentially-allocated merge-split (SAMS) sa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
26
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 26 publications
(27 citation statements)
references
References 46 publications
1
26
0
Order By: Relevance
“…We opted to use single reference populations from 1000 Genomes without high levels of admixture and with low intra-population variability [26], consisting of AFR (ESN), EUR (GBR) and EAS (JPT) groups plus sets of two CEPH OCE populations and five CEPH AMR populations (Table 1, population no. 1-6).…”
Section: Dna Samples and Population Datamentioning
confidence: 99%
See 1 more Smart Citation
“…We opted to use single reference populations from 1000 Genomes without high levels of admixture and with low intra-population variability [26], consisting of AFR (ESN), EUR (GBR) and EAS (JPT) groups plus sets of two CEPH OCE populations and five CEPH AMR populations (Table 1, population no. 1-6).…”
Section: Dna Samples and Population Datamentioning
confidence: 99%
“…4A, orange cells) from population analyses described in section 3.6. This strategy also compensated for the large contrasts in sample size of 1000 Genomes data for the first three groups alongside the much smaller sample sizes of Oceanian and Native American populations, which can interfere with STRUCTURE analyses [26].…”
Section: Dna Samples and Population Datamentioning
confidence: 99%
“…It results from the fact that if the MCMC chain is mixing properly, all the equivalent indexing assignment vectors will be sampled equally. To summarize the posterior sample in a way that accounts for the uncertainty of assignment of each locus, as well as for label switching, we considered three methods: (1) pairwise coassignment probabilities (e.g., Dawson and Belkhir 2001;Onogi et al 2011), (2) mean assignment based on partition distances (Huelsenbeck and Andolfatto 2007), and (3) marginal probabilities after relabeling [e.g., pivotal reordering (Lee et al 2009)]. With the first approach it is possible to identify, using Bayes factors, those pairs of loci for which there is strong evidence that both loci belong to the same group.…”
Section: þþmentioning
confidence: 99%
“…Mean assignment: Although it would not be meaningful to take an arithmetic average of sampled assignments, we can use the idea of "mean assignment", a, which is the vector that minimizes the squared partition distance to all of the sampled vectors (Huelsenbeck and Andolfatto 2007;Choi and Hey 2011;Onogi et al 2011). The partition distance is defined as the minimum number of elements that must be removed to make two assignment vectors identical, over all possible labeling of one of the vectors (Gusfield 2002).…”
Section: þþmentioning
confidence: 99%
See 1 more Smart Citation