2021
DOI: 10.1111/1755-0998.13474
|View full text |Cite
|
Sign up to set email alerts
|

Haploid, diploid, and pooled exome capture recapitulate features of biology and paralogy in two non‐model tree species

Abstract: Despite their suitability for studying evolution, many conifer species have large and repetitive giga‐genomes (16–31 Gbp) that create hurdles to producing high coverage SNP data sets that capture diversity from across the entirety of the genome. Due in part to multiple ancient whole genome duplication events, gene family expansion and subsequent evolution within Pinaceae, false diversity from the misalignment of paralog copies creates further challenges in accurately and reproducibly inferring evolutionary his… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
25
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3

Relationship

5
1

Authors

Journals

citations
Cited by 7 publications
(25 citation statements)
references
References 77 publications
0
25
0
Order By: Relevance
“…Due to the substantial size and complexity of conifer genomes, exome sequence capture is the preferred genotyping method for the taxon (Lind et al, 2022). We designed our capture probes based on previous Pinus contorta exome capture probes (Suren et al, 2016) and removed all probes which failed to effectively capture the target sequences.…”
Section: Probe Design and Sequence Capturementioning
confidence: 99%
See 1 more Smart Citation
“…Due to the substantial size and complexity of conifer genomes, exome sequence capture is the preferred genotyping method for the taxon (Lind et al, 2022). We designed our capture probes based on previous Pinus contorta exome capture probes (Suren et al, 2016) and removed all probes which failed to effectively capture the target sequences.…”
Section: Probe Design and Sequence Capturementioning
confidence: 99%
“…For instance, conifers often have exceptionally large genomes (20–40 Gbp; Neale et al, 2017) with histories of whole‐genome duplication (Zheng et al, 2015), gene family expansion (Scott et al, 2020; De La Torre et al, 2014), transposable element dynamics (Scott et al, 2020; Wang et al, 2020; Yi et al, 2018), and extensive repeat regions (Wegrzyn et al, 2014). These complexities present a major challenge for NGS data analysis and downstream hypothesis testing in conifers (Lind et al, 2022; Shu & Moran, 2020). Such challenges can be alleviated by quantifying the accuracy of SNP calling when using the above‐mentioned variant calling tools.…”
Section: Introductionmentioning
confidence: 99%
“…We used a pool-sequencing approach that targeted exon regions. For probe design details see (Lind et al, 2022). Briefly, the sequence capture probes were designed using genes identified in Douglas-fir RNA-seq data from (1) daily and cyclic induced experiments (Cronn et al, 2017) and (2…”
Section: Probe Design and Sequencingmentioning
confidence: 99%
“…As in (Lind et al, 2022), we used the VarScan pipeline (Lind, 2021a) to process our genomic data. Raw sequence reads were trimmed with fastp (v0.19.5, Chen et al 2018) by trimming reads that did not pass quality filters of less than twenty N's, a minimum mean Phred quality score of 30 for sliding windows of five base pairs, and a final length of 75 base pairs with no more than 20 base pairs called as N (-n 20 -M 30 -W 5 -l 75 -g -3).…”
Section: Mapping and Snp Callingmentioning
confidence: 99%
See 1 more Smart Citation