2020
DOI: 10.1093/bib/bbaa280
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of consensus strategies for haplotype phasing

Abstract: Haplotype phasing is a critical step for many genetic applications but incorrect estimates of phase can negatively impact downstream analyses. One proposed strategy to improve phasing accuracy is to combine multiple independent phasing estimates to overcome the limitations of any individual estimate. However, such a strategy is yet to be thoroughly explored. This study provides a comprehensive evaluation of consensus strategies for haplotype phasing. We explore the performance of different consensus paradigms,… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 18 publications
0
4
0
Order By: Relevance
“…In addition, some PRS tools need a previous imputation step, see the ‘Input Data’ column of the Tables for details. The phasing of the haplotypes, that is the separation between the maternally and paternally inherited copies of each chromosome, can be performed with tools like SHAPEIT 3, Eagle 2 and HAPI-UR [ 21 , 22 ]. Genotype imputation, the estimation of missing genotypes from a genotype reference panel, can be performed with tools like Minimac 4 from the Michigan Imputation Server, Impute 5, Beagle 5.2 or PBWT from Sanger Imputation Server [ 23 ].…”
Section: References Included In This Reviewmentioning
confidence: 99%
“…In addition, some PRS tools need a previous imputation step, see the ‘Input Data’ column of the Tables for details. The phasing of the haplotypes, that is the separation between the maternally and paternally inherited copies of each chromosome, can be performed with tools like SHAPEIT 3, Eagle 2 and HAPI-UR [ 21 , 22 ]. Genotype imputation, the estimation of missing genotypes from a genotype reference panel, can be performed with tools like Minimac 4 from the Michigan Imputation Server, Impute 5, Beagle 5.2 or PBWT from Sanger Imputation Server [ 23 ].…”
Section: References Included In This Reviewmentioning
confidence: 99%
“…Quality control was applied to the genotype datasets using PLINK to keep only SNPs with MAF > 0.01, Hardy-Weinberg equilibrium (HWE) < 10 −6 , the missing rate per individuals < 0.1, and missing rate per SNP < 0.1. Quality controlled genotype data were phased through consHap consensus phase estimator [23] by aggregating 15 applications of the SHAPEIT2 tool [24]. This approach has been reported to reduce the switch error rate significantly for datasets with small sample size (< 2,000).…”
Section: Genotype and Haplotype Preparationmentioning
confidence: 99%
“…An imputation software employing reconstructed haplotypes requires the pre-imputation computational processing step of phasing to be performed [19]. Haplotypes can be reconstructed for both related and unrelated individuals.…”
Section: Introductionmentioning
confidence: 99%
“…The suggested overall sample size for unrelated individuals is over 50; the bigger the sample size is, the longer the haplotypes that can be reconstructed [21]. The recommended strategy for unrelated individuals is to perform phasing with haplotype frequency information from a reference population which has been more densely genotyped or sequenced [19,22,23], such as from the HapMap Project and the 1000 Genomes Project [24][25][26]. For related individuals, estimation is performed by considering both the haplotypes that are shared between family members and the haplotype frequency information of the reference population.…”
Section: Introductionmentioning
confidence: 99%