2021
DOI: 10.22541/au.163976415.53888836/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SNPfiltR: an R package for interactive and reproducible SNP filtering

Abstract: Here I describe the novel R package SNPfiltR and demonstrate its functionalities as the backbone of a customizable, reproducible SNP filtering pipeline implemented exclusively via the widely adopted R programming language. SNPfiltR extends existing SNP filtering functionalities by automating the visualization of key parameters such as depth, quality, and missing data, then allowing users to set filters based on optimized thresholds, all within a single, cohesive working environment. All SNPfiltR functions requ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(10 citation statements)
references
References 19 publications
0
10
0
Order By: Relevance
“…This has the advantage of taking into account random sampling and genotyping errors that affect loci differently. In fact, this approach is available in VCFtools but not yet in dartR , snpR orSNPfiltR (Hohenlohe et al 2011;Denecek et al 2011;Mijangos et al 2022;Hemstrom & Jones 2022;DeRaad 2022). Nonetheless, we would like to emphasize that this is not a Hardy-Weinberg equilibrium filter (which requires critical thinking to be correctly applied and interpreted; Waples 2015), and should be used only when looking to obtain neutral autosomal loci (cf.…”
Section: Discussionmentioning
confidence: 99%
“…This has the advantage of taking into account random sampling and genotyping errors that affect loci differently. In fact, this approach is available in VCFtools but not yet in dartR , snpR orSNPfiltR (Hohenlohe et al 2011;Denecek et al 2011;Mijangos et al 2022;Hemstrom & Jones 2022;DeRaad 2022). Nonetheless, we would like to emphasize that this is not a Hardy-Weinberg equilibrium filter (which requires critical thinking to be correctly applied and interpreted; Waples 2015), and should be used only when looking to obtain neutral autosomal loci (cf.…”
Section: Discussionmentioning
confidence: 99%
“…For the final alignment, we retained loci that were present in 75% of the samples. For analyses that require unlinked loci, we further filtered our dataset using the R packages SNPfiltR ( DeRaad, 2021 ) and vcfR ( Knaus & Grünwald, 2017 ), retaining only loci that were more than 1000bp away from one another.…”
Section: Methodsmentioning
confidence: 99%
“…GBS products from separate genomic regions that map to such short, artifactual contigs would be expected to yield heterozygote genotype calls. To minimize these artifacts we: (1) filtered out heterozygotes when allele balance fell below 0.333 or above 0.667 using the R- package SNPfiltr v. 1.00 (DeRaad, 2022), (2) restricted our analysis to SNPs that map to scaffolds > 100 kbp in length (these account for 73% of the L. alabamica assembly); (3) filtered out SNP loci that exhibited only heterozygotes when adjacent SNP loci on the same scaffold exhibited segregation ratios that were not significantly different from 1:2:1 by Chi-square tests, on the basis that evidence of strong selection should be shared by adjacent SNP loci; and (4) restricted our analysis to loci where data were available from > 90% of the progeny of a cross since genotyping errors are more likely to have occurred when few progeny per family are genotyped successfully.…”
Section: Methodsmentioning
confidence: 99%