2017
DOI: 10.1111/mec.14205
|View full text |Cite
|
Sign up to set email alerts
|

Investigating the genomic basis of discrete phenotypes using a Pool‐Seq‐only approach: New insights into the genetics underlying colour variation in diverse taxa

Abstract: While large-scale genomic approaches are increasingly revealing the genetic basis of polymorphic phenotypes such as colour morphs, such approaches are almost exclusively conducted in species with high-quality genomes and annotations. Here, we use Pool-Seq data for both genome assembly and SNP frequency estimation, followed by scanning for F outliers to identify divergent genomic regions. Using paired-end, short-read sequencing data from two groups of individuals expressing divergent phenotypes, we generate a d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
32
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
7
1

Relationship

4
4

Authors

Journals

citations
Cited by 29 publications
(32 citation statements)
references
References 51 publications
0
32
0
Order By: Relevance
“…Pooling samples before library preparations, also called "poolseq", can be used for projects with hundreds of samples and if tracing back individual samples is not relevant for the research question at hand (Himmelbach et al, 2014;Anand et al, 2016). This strategy is useful for the identification of variable regions between populations, especially when population sampling must be higher than what the budget allows for sequencing as individual libraries (Neethiraj et al, 2017). Because with this method it is possible to sample many individuals within a population, there is more information for detecting rare variants across the population.…”
Section: Amplificationmentioning
confidence: 99%
“…Pooling samples before library preparations, also called "poolseq", can be used for projects with hundreds of samples and if tracing back individual samples is not relevant for the research question at hand (Himmelbach et al, 2014;Anand et al, 2016). This strategy is useful for the identification of variable regions between populations, especially when population sampling must be higher than what the budget allows for sequencing as individual libraries (Neethiraj et al, 2017). Because with this method it is possible to sample many individuals within a population, there is more information for detecting rare variants across the population.…”
Section: Amplificationmentioning
confidence: 99%
“…As an alternative to using reference genomes of closely related species or for situations when no such reference is available, we here apply a newly developed exon mining via Pool‐seq approach to acquire a draft genome assembly both for the focal species and for single nucleotide polymorphism (SNP) frequency estimation. The approach leverages the power of Pool‐seq to subsample the genome to obtain high‐resolution genomic insights quickly and at a reasonable cost (Neethiraj, Hornett, Hill, & Wheat, ). This pipeline has been utilized for elucidating the genomics underlying phenotypic differences between populations of several butterfly species (Keehnen, Hill, Nylin, & Wheat, ; Pruisscher, Nylin, Gotthard, & Wheat, ; Woronik & Wheat, ).…”
Section: Introductionmentioning
confidence: 99%
“…Pool‐seq data from populations are mapped against the final gene‐models‐only genome assembly which contains both protein‐coding and noncoding sequences, for estimation of population diversity and differentiation (akin to mapping RNA‐seq data against a de novo transcriptome of the same data). This method (Neethiraj et al, ) has similarities with that of Therkildsen and Palumbi () who mapped Pool‐seq reads to a reference transcriptome. In our case, we generate a transcriptome from published RNA‐seq data and use it to scaffold and annotate a draft genome assembly from Pool‐seq data.…”
Section: Introductionmentioning
confidence: 99%
“…As this genome lacks an annotation, we generated a draft annotation using spaln v2.1.2 (Gotoh, ; Iwata & Gotoh, ), which is a high‐performance protein to genome alignment program that is exon boundary aware, and the Danaus plexippus protein set DPOGS2 (Zhan & Reppert, ). In addition, the MESPA pipeline (Neethiraj, Hornett, Hill, & Wheat, ), which uses spaln to construct gene models from highly fragmented genome assemblies, was used to identify the immune genes. A set of 225 immune proteins were used for annotation, with the majority ( N = 205) of these being the canonical set of immune genes identified in Bombyx mori (Lepidoptera, Bombycidae; Tanaka et al., ), and an additional 20 candidate genes being antimicrobial peptides identified in the closely related species Pieris rapae (Lepidoptera, Pieridae; C. Wheat unpublished data).…”
Section: Methodsmentioning
confidence: 99%