2023
DOI: 10.1101/2023.07.27.550877
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Widespread deviant patterns of heterozygosity in whole-genome sequencing due to autopolyploidy, repeated elements, and duplication

Abstract: Most population genomic tools rely on accurate SNP calling and filtering to meet their underlying assumptions. However, genomic complexity, due to structural variants, paralogous sequences and repetitive elements, presents significant challenges in assembling contiguous reference genomes. Consequently, short-read resequencing studies can encounter mismapping issues, leading to SNPs that deviate from Mendelian expected patterns of heterozygosity and allelic ratio. In this study, we employed the ngsParalog softw… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 83 publications
0
2
0
Order By: Relevance
“…To identify syntenic regions, we followed the steps described by Dallaire et al. (2023): in summary, we aligned the genome to itself with nucmer (built‐in mapper in MUMmer version 4.0.0; Marçais et al., 2018), then performed synteny analysis with SyMAP (Soderlund et al., 2011), and re‐mapped syntenic blocks to the genome with LASTZ (version 1.04.15; Harris, 2007) to get the homology percentage. We identified repeats and transposable elements using RepeatMasker (version 4.0.8; Smit et al., 2013).…”
Section: Methodsmentioning
confidence: 99%
“…To identify syntenic regions, we followed the steps described by Dallaire et al. (2023): in summary, we aligned the genome to itself with nucmer (built‐in mapper in MUMmer version 4.0.0; Marçais et al., 2018), then performed synteny analysis with SyMAP (Soderlund et al., 2011), and re‐mapped syntenic blocks to the genome with LASTZ (version 1.04.15; Harris, 2007) to get the homology percentage. We identified repeats and transposable elements using RepeatMasker (version 4.0.8; Smit et al., 2013).…”
Section: Methodsmentioning
confidence: 99%
“…In an effort to better link the genomic context and the confidence in SV calling, we compared the frequency distributions of both high-quality SVs that passed all filtering steps and low-quality, filtered out SVs in two genomic features known to interfere with SV calling: highly similar regions resulting from whole-genome duplication events (e.g., syntenic regions) and repeated content (repeats and transposable elements). To identify syntenic regions, we followed the steps described by Dallaire et al (2023): in summary, we aligned the genome to itself with nucmer (built-in mapper in MUMmer version 4.0.0; Marçais et al, 2018), then performed synteny analysis with SyMAP (Soderlund, Bomhoff & Nelson, 2011), et re-mapped syntenic blocks to the genome with LASTZ (version 1.04.15, Harris, 2007) to get the homology percentage. We identified repeats and transposable elements using RepeatMasker (version 4.0.8).…”
Section: Methodsmentioning
confidence: 99%