2024
DOI: 10.1111/1755-0998.13947
|View full text |Cite
|
Sign up to set email alerts
|

Estimates of heterozygosity from single nucleotide polymorphism markers are context‐dependent and often wrong

Jarrod Sopniewski,
Renee A. Catullo

Abstract: Genetic diversity is frequently described using heterozygosity, particularly in a conservation context. Often, it is estimated using single nucleotide polymorphisms (SNPs); however, it has been shown that heterozygosity values calculated from SNPs can be biased by both study design and filtering parameters. Though solutions have been proposed to address these issues, our own work has found them to be inadequate in some circumstances. Here, we aimed to improve the reliability and comparability of heterozygosity… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

1
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(3 citation statements)
references
References 64 publications
1
2
0
Order By: Relevance
“…4), suggesting that k-mer-based diversity may scale with population size better by capturing additional neutral genome size variation. Overall, these results suggests that diversity missed by SNPs explains part of Lewontin ’ s paradox in plants, consistent with literature suggesting that SNPs provide an incomplete picture of genome-wide polymorphism [Schmidt et al, 2021, VanWallendael and Alvarez, 2022, Jaegle et al, 2023, Sopniewski and Catullo, 2024].…”
Section: Discussionsupporting
confidence: 89%
See 1 more Smart Citation
“…4), suggesting that k-mer-based diversity may scale with population size better by capturing additional neutral genome size variation. Overall, these results suggests that diversity missed by SNPs explains part of Lewontin ’ s paradox in plants, consistent with literature suggesting that SNPs provide an incomplete picture of genome-wide polymorphism [Schmidt et al, 2021, VanWallendael and Alvarez, 2022, Jaegle et al, 2023, Sopniewski and Catullo, 2024].…”
Section: Discussionsupporting
confidence: 89%
“…Finally, previous meta-analyses of population size and diversity data rely on scraping diversity estimates from previously published studies (Frankham [2012], Buffalo [2021], except see Corbett-Detig et al [2015]). However, many studies report inaccurate SNP calls and diversity estimates due to errors in the handling of missing data [Korunes and Samuk, 2021, Schmidt et al, 2021, Sopniewski and Catullo, 2024 and may filter genotype calls differently, making comparisons across species difficult. Re-analyzing whole genome sequencing data with a common pipeline would make diversity estimates across species more comparable [Buffalo, 2021, Mirchandani et al, 2024.…”
Section: Introductionmentioning
confidence: 99%
“…Unexpected patterns in genetic diversity may be the result of recently recognised biases in SNP-only measures of heterozygosity, that do not account for invariant (Schmidt et al, 2021) or multi-allelic (Sopniewski & Catullo, 2024) sites, which may have affected results in a previous study of genetic diversity in northern Australian lizards (Fenker et al, 2021). To ensure methodological factors did not impact our conclusions, we leveraged two high-quality Gehyra reference genomes for bioinformatic processing, that allowed us to estimate individual heterozygosity using both variant and invariant sites.…”
Section: Bioinformatic Processing and Variant Callingmentioning
confidence: 99%