2021
DOI: 10.1038/s41588-020-00756-0
|View full text |Cite|
|
Sign up to set email alerts
|

Efficient phasing and imputation of low-coverage sequencing data using large reference panels

Abstract: Low-coverage whole genome sequencing followed by imputation has been proposed as a cost-effective genotyping approach for disease and population genetics studies. However, its competitiveness against SNP arrays is undermined as current imputation methods are computationally expensive and unable to leverage large reference panels. Here, we describe a method, GLIMPSE, for phasing and imputation of low-coverage sequencing datasets from modern reference panels. We demonstrate its remarkable performance across diff… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

5
326
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 280 publications
(331 citation statements)
references
References 39 publications
5
326
0
Order By: Relevance
“…We considered the 70 key ancestor animals as a reference to call genotypes from low-pass sequencing data (1.11-fold) of genetically similar pigs. In agreement with previous studies in human and cattle populations, the genotyping accuracy from the low-pass sequencing data was very high [6,14,56]. Moreover, the low-pass sequencing-derived genomic relationship coe cients were highly correlated with those obtained using microarray genotyping.…”
Section: Discussionsupporting
confidence: 89%
See 3 more Smart Citations
“…We considered the 70 key ancestor animals as a reference to call genotypes from low-pass sequencing data (1.11-fold) of genetically similar pigs. In agreement with previous studies in human and cattle populations, the genotyping accuracy from the low-pass sequencing data was very high [6,14,56]. Moreover, the low-pass sequencing-derived genomic relationship coe cients were highly correlated with those obtained using microarray genotyping.…”
Section: Discussionsupporting
confidence: 89%
“…To compile the reference haplotypes, we retained 22,618,811 biallelic autosomal SNP that were polymorphic (minor allele count ≥ 1) among the 70 key ancestor pigs. Following the approach proposed by Rubinacci et al [6], we used the mpileup and call commands of BCFtools [67] to calculate genotype likelihoods at the 22,618,811 polymorphic sites in the 175 low-pass sequenced and reference-aligned samples. Subsequently, we applied the phasing and imputation algorithm implemented in GLIMPSE_phase [6] to re ne the BCFtools-derived genotype calls using the previously established haplotype reference panel.…”
Section: Analysis Of Low-pass Sequence Datamentioning
confidence: 99%
See 2 more Smart Citations
“…The availability of sequence variant genotypes from key ancestor animals enables imputing sequence-level genotypes for animals that had been genotyped at lower density [68]. In livestock populations that are routinely genotyped using 60K genotyping arrays, sequence variant genotypes are typically imputed using stepwise imputation [9].…”
Section: Introductionmentioning
confidence: 99%