MEGASAT is software that enables genotyping of microsatellite loci using next-generation sequencing data. Microsatellites are amplified in large multiplexes, and then sequenced in pooled amplicons. MEGASAT reads sequence files and automatically scores microsatellite genotypes. It uses fuzzy matches to allow for sequencing errors and applies decision rules to account for amplification artefacts, including nontarget amplification products, replication slippage during PCR (amplification stutter) and differential amplification of alleles. An important feature of MEGASAT is the generation of histograms of the length-frequency distributions of amplification products for each locus and each individual. These histograms, analogous to electropherograms traditionally used to score microsatellite genotypes, enable rapid evaluation and editing of automatically scored genotypes. MEGASAT is written in Perl, runs on Windows, Mac OS X and Linux systems, and includes a simple graphical user interface. We demonstrate MEGASAT using data from guppy, Poecilia reticulata. We genotype 1024 guppies at 43 microsatellites per run on an Illumina MiSeq sequencer. We evaluated the accuracy of automatically called genotypes using two methods, based on pedigree and repeat genotyping data, and obtained estimates of mean genotyping error rates of 0.021 and 0.012. In both estimates, three loci accounted for a disproportionate fraction of genotyping errors; conversely, 26 loci were scored with 0-1 detected error (error rate ≤0.007). Our results show that with appropriate selection of loci, automated genotyping of microsatellite loci can be achieved with very high throughput, low genotyping error and very low genotyping costs.
Individual assignment and genetic mixture analysis are commonly utilized in contemporary wildlife and fisheries management. Although microsatellite loci provide unparalleled numbers of alleles per locus, their use in assignment applications is increasingly limited. However, next‐generation sequencing, in conjunction with novel bioinformatic tools, allows large numbers of microsatellite loci to be simultaneously genotyped, presenting new opportunities for individual assignment and genetic mixture analysis. Here, we scanned the published Atlantic salmon genome to identify 706 microsatellite loci, from which we developed a final panel of 101 microsatellites distributed across the genome (average 3.4 loci per chromosome). Using samples from 35 Atlantic salmon populations (n = 1,485 individuals) from coastal Labrador, Canada, a region characterized by low levels of differentiation in this species, this panel identified 844 alleles (average of 8.4 alleles per locus). Simulation‐based evaluations of assignment and mixture identification accuracy revealed unprecedented resolution, clearly identifying 26 rivers or groups of rivers spanning 500 km of coastline. This baseline was used to examine the stock composition of 696 individuals harvested in the Labrador Atlantic salmon fishery and revealed that coastal fisheries largely targeted regional groups (<300 km). This work suggests that the development and application of large sequenced microsatellite panels presents great potential for stock resolution in Atlantic salmon and more broadly in other exploited anadromous and marine species.
Conservation of exploited species requires an understanding of both genetic diversity and the dominant structuring forces, particularly near range limits, where climatic variation can drive rapid expansions or contractions of geographic range. Here, we examine population structure and landscape associations in Atlantic salmon (Salmo salar) across a heterogeneous landscape near the northern range limit in Labrador, Canada. Analysis of two amplicon-based data sets containing 101 microsatellites and 376 single nucleotide polymorphisms (SNPs) from 35 locations revealed clear differentiation between populations spawning in rivers flowing into a large marine embayment (Lake Melville) compared to coastal populations. The mechanisms influencing the differentiation of embayment populations were investigated using both multivariate and machine-learning landscape genetic approaches. We identified temperature as the strongest correlate with genetic structure, particularly warm temperature extremes and wider annual temperature ranges. The genomic basis of this divergence was further explored using a subset of locations (n = 17) and a 220K SNP array. SNPs associated with spatial structuring and temperature mapped to a diverse set of genes and molecular pathways, including regulation of gene expression, immune response, and cell development and differentiation. The results spanning molecular marker types and both novel and established methods clearly show climate-associated, fine-scale population structure across an environmental gradient in Atlantic salmon near its range limit in North America, highlighting valuable approaches for predicting population responses to climate change and managing species sustainability.
Chromosomal rearrangements (e.g., inversions, fusions, and translocations) have long been associated with environmental variation in wild populations. New genomic tools provide the opportunity to examine the role of these structural variants in shaping adaptive differences within and among wild populations of non‐model organisms. In Atlantic Salmon (Salmo salar), variations in chromosomal rearrangements exist across the species natural range, yet the role and importance of these structural variants in maintaining adaptive differences among wild populations remains poorly understood. We genotyped Atlantic Salmon (n = 1429) from 26 populations within a highly genetically structured region of southern Newfoundland, Canada with a 220K SNP array. Multivariate analysis, across two independent years, consistently identified variation in a structural variant (translocation between chromosomes Ssa01 and Ssa23), previously associated with evidence of trans‐Atlantic secondary contact, as the dominant factor influencing population structure in the region. Redundancy analysis suggested that variation in the Ssa01/Ssa23 chromosomal translocation is strongly correlated with temperature. Our analyses suggest environmentally mediated selection acting on standing genetic variation in genomic architecture introduced through secondary contact may underpin fine‐scale local adaptation in Placentia Bay, Newfoundland, Canada, a large and deep embayment, highlighting the importance of chromosomal structural variation as a driver of contemporary adaptive divergence.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.