SNPfiltR: an R package for interactive and reproducible SNP filtering

DeRaad, Devon A.

doi:10.22541/au.163976415.53888836/v1

Cited by 8 publications

(10 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This has the advantage of taking into account random sampling and genotyping errors that affect loci differently. In fact, this approach is available in VCFtools but not yet in dartR , snpR orSNPfiltR (Hohenlohe et al 2011;Denecek et al 2011;Mijangos et al 2022;Hemstrom & Jones 2022;DeRaad 2022). Nonetheless, we would like to emphasize that this is not a Hardy-Weinberg equilibrium filter (which requires critical thinking to be correctly applied and interpreted; Waples 2015), and should be used only when looking to obtain neutral autosomal loci (cf.…”

Section: Discussionmentioning

confidence: 99%

Easy-to-use R functions to separate reduced-representation genomic datasets into sex-linked and autosomal loci, and conduct sex-assignment

Robledo‐Ruiz

Austin

Amos

et al. 2022

Preprint

View full text Add to dashboard Cite

Identifying sex-linked markers in genomic datasets is important, because their analyses can reveal sex-specific biology, and their presence in supposedly neutral autosomal datasets can result in incorrect estimates of genetic diversity, population structure and parentage. But detecting sex-linked loci can be challenging, and available scripts neglect some categories of sex-linked variation. Here, we present new R functions to (1) identify and separate sex-linked loci in ZW and XY sex determination systems and (2) infer the genetic sex of individuals based on these loci. Two additional functions are presented, to (3) remove loci with artefactually high heterozygosity, and (4) produce input files for parentage analysis. We test these functions on genomic data for two sexually-monomorphic bird species, including one with a neo-sex chromosome system, by comparing biological inferences made before and after removing sex-linked loci using our function. We found that standard filters, such as low read depth and call rate, failed to remove up to 28.7% of sex-linked loci. This led to (i) overestimation of population FIS by ≤ 9%, and the number of private alleles by ≤ 8%; (ii) wrongly inferring significant sex-differences in heterozygosity, (iii) obscuring genetic population structure, and (iv) inferring ~11% fewer correct parentages. We discuss how failure to remove sex-linked markers can lead to incorrect biological inferences (e.g., sex-biased dispersal and cryptic population structure) and misleading management recommendations. For reduced-representation datasets with at least 15 known-sex individuals of each sex, our functions offer convenient, easy-to-use resources to avoid this, and to sex the remaining individuals.

show abstract

Section: Discussionmentioning

confidence: 99%

Easy-to-use R functions to separate reduced-representation genomic datasets into sex-linked and autosomal loci, and conduct sex-assignment

Robledo‐Ruiz

Austin

Amos

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…For the final alignment, we retained loci that were present in 75% of the samples. For analyses that require unlinked loci, we further filtered our dataset using the R packages SNPfiltR ( DeRaad, 2021 ) and vcfR ( Knaus & Grünwald, 2017 ), retaining only loci that were more than 1000bp away from one another.…”

Section: Methodsmentioning

confidence: 99%

Drivers of phenotypic divergence in a Mesoamerican highland bird

Robles-Bello

Vázquez-López

Ramírez‐Barrera

et al. 2022

PeerJ

View full text Add to dashboard Cite

Animals derive their coloration from a variety of pigments as well as non-pigmentary structural features. One of the most widespread types of pigments are carotenoids, which are used by all invertebrate taxa and most vertebrate orders to generate red, pink, orange and yellow coloration. Despite their widespread use by diverse animal groups, animals obligately obtain carotenoid pigments from diet. Carotenoid-based coloration is therefore modulated by evolutionary and ecological processes that affect the acquisition and deposition of these pigments into tegumentary structures. The Flame-colored Tanager (Piranga bidentata) is a highland songbird in the cardinal family (Cardinalidae) that is distributed from Mexican sierras through Central America up to western Panama. While female plumage throughout its entire range is predominantly yellow, males exhibit a noticeable split in ventral plumage color, which is bright orange on the West slope and the Tres Marias Islands and blood red in Eastern Mexico and Central America. We used Multiple Regression on Matrices (MRM) to evaluate the relative contributions of geographic distance, climate and genetic distance on color divergence and body differences between geographically disjunct populations. We found that differentiation in carotenoid plumage coloration was mainly explained by rainfall differences between disjunct populations, whereas body size differences was best explained by variation in the annual mean temperature and temperature of coldest quarter. These results indicate that climate is a strong driver of phenotypic divergence in Piranga bidentata.

show abstract

“…GBS products from separate genomic regions that map to such short, artifactual contigs would be expected to yield heterozygote genotype calls. To minimize these artifacts we: (1) filtered out heterozygotes when allele balance fell below 0.333 or above 0.667 using the R- package SNPfiltr v. 1.00 (DeRaad, 2022), (2) restricted our analysis to SNPs that map to scaffolds > 100 kbp in length (these account for 73% of the L. alabamica assembly); (3) filtered out SNP loci that exhibited only heterozygotes when adjacent SNP loci on the same scaffold exhibited segregation ratios that were not significantly different from 1:2:1 by Chi-square tests, on the basis that evidence of strong selection should be shared by adjacent SNP loci; and (4) restricted our analysis to loci where data were available from > 90% of the progeny of a cross since genotyping errors are more likely to have occurred when few progeny per family are genotyped successfully.…”

Section: Methodsmentioning

confidence: 99%

The maintenance of self-incompatibility: overdominance of inbreeding depression and S-linked genetic load

Schoen

Baldwin

2022

Preprint

View full text Add to dashboard Cite

Inbreeding depression plays a fundamental role in evolution. To help detect and characterize viability loci that underlie inbreeding depression, we forced self-pollinated plants from self-incompatible populations of Leavenworthia alabamica to produce families of progeny that were genotyped at hundreds of single nucleotide polymorphism (SNP) loci. Bayesian analysis of segregation data for each SNP was used to explore support for different dominance and selection coefficients at linked viability loci. There was strong support for overdominance (or pseudo-overdomiance) at many viability loci, and some support for recessivity and underdominance. One recessive viability locus mapped to the genomic region of the novel self-incompatibility locus in Leavenworthia alabamica. The results are consistent with earlier findings showing that inbreeding depression is recalcitrant to purging in Leavenworthia alabamica. The results also help account for the maintenance of self-incompatibility in this species and are consistent with expectations from evolutionary genetic theory that recessive, deleterious alleles linked to loci under balancing selection are sheltered from selection.

show abstract

SNPfiltR: an R package for interactive and reproducible SNP filtering

Cited by 8 publications

References 19 publications

Easy-to-use R functions to separate reduced-representation genomic datasets into sex-linked and autosomal loci, and conduct sex-assignment

Easy-to-use R functions to separate reduced-representation genomic datasets into sex-linked and autosomal loci, and conduct sex-assignment

Drivers of phenotypic divergence in a Mesoamerican highland bird

The maintenance of self-incompatibility: overdominance of inbreeding depression and S-linked genetic load

Contact Info

Product

Resources

About