2021
DOI: 10.1371/journal.pcbi.1009449
|View full text |Cite
|
Sign up to set email alerts
|

Estimating repeat spectra and genome length from low-coverage genome skims with RESPECT

Abstract: The cost of sequencing the genome is dropping at a much faster rate compared to assembling and finishing the genome. The use of lightly sampled genomes (genome-skims) could be transformative for genomic ecology, and results using k-mers have shown the advantage of this approach in identification and phylogenetic placement of eukaryotic species. Here, we revisit the basic question of estimating genomic parameters such as genome length, coverage, and repeat structure, focusing specifically on estimating the k-me… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(19 citation statements)
references
References 27 publications
0
19
0
Order By: Relevance
“…To understand how LTRs may be related to species divergence within Oreohelix, we compared repeat content estimated from genome skims, LTR insertion times, and Oreohelix species divergence estimates from a previous study [ 16 ] . Analysis of genome skims using RESPECT v. 1.0.0 [ 36 ] revealed O. idahoensis contains roughly 2–3 fold of high copy repetitive k-mers per million bases compared to a closely related smooth, non-limestone endemic O. s. strigosa and a distantly related smooth, non-limestone species O. jugalis (Table 3 ) . We estimated LTR insertion times by comparing divergence between LTR sister pairs using a previously established molluscan substitution rate (see Materials and Methods).…”
Section: Resultsmentioning
confidence: 99%
“…To understand how LTRs may be related to species divergence within Oreohelix, we compared repeat content estimated from genome skims, LTR insertion times, and Oreohelix species divergence estimates from a previous study [ 16 ] . Analysis of genome skims using RESPECT v. 1.0.0 [ 36 ] revealed O. idahoensis contains roughly 2–3 fold of high copy repetitive k-mers per million bases compared to a closely related smooth, non-limestone endemic O. s. strigosa and a distantly related smooth, non-limestone species O. jugalis (Table 3 ) . We estimated LTR insertion times by comparing divergence between LTR sister pairs using a previously established molluscan substitution rate (see Materials and Methods).…”
Section: Resultsmentioning
confidence: 99%
“…Another possibility is that our sequencing data did not contain a faithful representation of the genome contents of our samples due to some intrinsic bias in the library preparation or sequencing technology (Pfeiffer et al, 2018). It is also notable that different k-mer-based tools produce different genome size estimates, suggesting that some models are more accurate than others (Melsted and Halldórsson, 2014;Sarmashghi et al, 2021). Finally, there is also error associated with genome size estimates made with flow cytometry, most notably that certain dyes will bind to particular sequence motifs (Doležel et al, 1998), and that sizing is made indirectly relative to a reference standard (which is also subject to associated error).…”
Section: Measuring Genome Size Differences With K-mersmentioning
confidence: 99%
“…To understand how LTRs may be related to species divergence within Oreohelix, we compared repeat content estimated from genome skims, LTR insertion times, and Oreohelix species divergence estimates from a previous study [14]. Analysis of genome skims using RESPECT v. 1.0.0 [34] revealed O. idahoensis contains roughly 2-3 fold of high copy repetitive k-mers per million bases compared to a closely related smooth, non-limestone endemic O. s. strigosa and a distantly related smooth, non-limestone species O. jugalis (Table 3). We estimated LTR insertion times by comparing divergence between LTR sister pairs using a previously established molluscan substitution rate (see Materials and Methods).…”
Section: Transposable Element Content and Evolutionmentioning
confidence: 99%
“…Finally, we examined genome size estimates of the closely related O. s. strigosa and distantly related O. jugalis by analyzing k-mer counts of genome skims using RESPECT v.1.0.0 [34]. Genome size estimates were roughly 1.5 times greater for O. idahoensis genome skims compared to either smooth form species (~ 9 Gb :~6 Gb; Table 3).…”
Section: Genome Size and Ltr Expansionmentioning
confidence: 99%
See 1 more Smart Citation