Anadromous Chinook salmon populations vary in the period of river entry at the initiation of adult freshwater migration, facilitating optimal arrival at natal spawning. Run timing is a polygenic trait that shows evidence of rapid parallel evolution in some lineages, signifying a key role for this phenotype in the ecological divergence between populations. Studying the genetic basis of local adaptation in quantitative traits is often impractical in wild populations. Therefore, we used a novel approach, Random Forest, to detect markers linked to run timing across 14 populations from contrasting environments in the Columbia River and Puget Sound, USA. The approach permits detection of loci of small effect on the phenotype. Divergence between populations at these loci was then examined using both principle component analysis and FST outlier analyses, to determine whether shared genetic changes resulted in similar phenotypes across different lineages. Sequencing of 9107 RAD markers in 414 individuals identified 33 predictor loci explaining 79.2% of trait variance. Discriminant analysis of principal components of the predictors revealed both shared and unique evolutionary pathways in the trait across different lineages, characterized by minor allele frequency changes. However, genome mapping of predictor loci also identified positional overlap with two genomic outlier regions, consistent with selection on loci of large effect. Therefore, the results suggest selective sweeps on few loci and minor changes in loci that were detected by this study. Use of a polygenic framework has provided initial insight into how divergence in a trait has occurred in the wild.
Large genomic studies are becoming increasingly common with advances in sequencing technology, and our ability to understand how genomic variation influences phenotypic variation between individuals has never been greater. The exploration of such relationships first requires the identification of associations between molecular markers and phenotypes. Here, we explore the use of Random Forest (RF), a powerful machine-learning algorithm, in genomic studies to discern loci underlying both discrete and quantitative traits, particularly when studying wild or nonmodel organisms. RF is becoming increasingly used in ecological and population genetics because, unlike traditional methods, it can efficiently analyse thousands of loci simultaneously and account for nonadditive interactions. However, understanding both the power and limitations of Random Forest is important for its proper implementation and the interpretation of results. We therefore provide a practical introduction to the algorithm and its use for identifying associations between molecular markers and phenotypes, discussing such topics as data limitations, algorithm initiation and optimization, as well as interpretation. We also provide short R tutorials as examples, with the aim of providing a guide to the implementation of the algorithm. Topics discussed here are intended to serve as an entry point for molecular ecologists interested in employing Random Forest to identify trait associations in genomic data sets.
The discernment of populations as management units is a fundamental prerequisite for sustainable exploitation of species. A lack of clear stock boundaries complicates not only the identification of spatial management units, but also the assessment of mixed fisheries by population assignment and mixed stock analysis. Many marine species, such as Pacific cod, are characterized by isolation by distance, showing significant differentiation but no clear stock boundaries. Here, we used restriction‐site‐associated DNA (RAD) sequencing to investigate population structure and assess power to genetically assign Pacific cod to putative populations of origin. Samples were collected across the species range in the eastern Pacific Ocean, from the Salish Sea to the Aleutian Islands. A total of 6,425 putative biallelic single nucleotide polymorphisms were identified from 276 individuals. We found a strong isolation‐by‐distance signal along coastlines that mirrored previous microsatellite results and pronounced genetic differentiation between coastal samples and those from the inland waters of the Salish Sea, with no evidence for hybridization between these two populations. Individual assignment success based on two methods was high overall (≥84%) but decreased from south to north. Assignment to geographic location of origin also was successful, with average distance between capture and assignment location of 220 km. Outlier analyses identified more loci potentially under selection along the coast than between Salish Sea and coastal samples, suggesting more diverse adaptation to latitudinal environmental factors than inshore vs. offshore environments. Our results confirm previous observations of sharp genetic differentiation of the Salish Sea population and isolation by distance along the coast, but also highlight the feasibility of using modern genomic techniques to inform stock boundaries and fisheries management in a low F ST marine species.
Poleward species range shifts have been predicted to result from climate change, and many observations have confirmed such movement. Poleward shifts may represent a homogeneous shift in distribution, seasonal northward movement of specific populations, or colonization processes at the poleward edge of the distribution. The ecosystem of the Bering Sea has been changing along with the climate, moving from an arctic to a subarctic system. Several fish species have been observed farther north than previously reported and in increasing abundances. We examined one of these fish species, Pacific cod, in the northern Bering Sea (NBS) to assess whether they migrated from another stock in the eastern Bering Sea (EBS), Gulf of Alaska, or Aleutian Islands, or whether they represent a separate population. Genetic analyses using 3,599 single nucleotide polymorphism markers indicated that nonspawning cod collected in August 2017 in the NBS were similar to spawning stocks of cod in the EBS.This result suggests escalating northward movement of the large EBS stock during summer months. Whether the cod observed in the NBS migrate south during winter to spawn or remain in the NBS as a sink population is unknown.
Understanding and identifying the genetic mechanisms responsible for sex-determination are important for species management, particularly in exploited fishes where sex biased harvest could have implications on population dynamics and long-term persistence. The Pacific halibut (Hippoglossus stenolepis) supports important fisheries in the North Pacific Ocean. The proportion of each sex in the annual harvest is currently estimated using growth curves, but genetic techniques may provide a more accurate method. We used restriction-site associated DNA (RAD) sequencing to identify RAD-tags that were linked to genetic sex, based on differentiation (FST) between the sexes. Identified RAD-tags were aligned to the Atlantic halibut (Hippoglossus hippoglossus) linkage map, the turbot (Scophthalmus maximus) genome, and the half-smooth tongue sole (Cynoglossus semilaevis) genome to identify genomic regions that may be involved in sex determination. In total, 56 RAD-tags (70 single nucleotide polymorphisms) were linked to sex, and 3 RAD-tags were identified in only females. Sex-linked loci aligned to 3 linkage groups in the Atlantic halibut (LG07: 7 loci, LG15: 1 locus, and LG24: 1 locus), 3 chromosomes in the turbot (LG12: 13 loci, LG01: 1 locus, and LG05: 1 locus), and 1 chromosome in the half-smooth tongue sole (ChrZ: 9 loci). Results add support to the hypothesis that Pacific halibut genetic sex is determined in a ZW system. Two sex-linked loci were further developed into sex identification assays, and their efficacy was tested on individuals that had been morphologically sexed. The accuracy of each assay on its own was 97.5% compared to morphological sex.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.