2016
DOI: 10.1007/s00438-016-1225-9
|View full text |Cite
|
Sign up to set email alerts
|

Methods for accurate quantification of LTR-retrotransposon copy number using short-read sequence data: a case study in Sorghum

Abstract: Transposable elements (TEs) are ubiquitous in eukaryotic genomes and their mobility impacts genome structure and function in myriad ways. Because of their abundance, activity, and repetitive nature, the characterization and analysis of TEs remain challenging, particularly from short-read sequencing projects. To overcome this difficulty, we have developed a method that estimates TE copy number from short-read sequences. To test the accuracy of our method, we first performed an in silico analysis of the referenc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 67 publications
(69 reference statements)
0
4
0
Order By: Relevance
“…Interestingly, most of the software used to identify TEs requires assembled sequences as input, even though assembly algorithms have trouble with highly repetitive sections of genomes [4,66,203,204]. Repeats cause branches in graphs used in assembly algorithms (which can be one of two classes: overlap-based and De Bruijn graph) [205], leading assemblers to create false joins and wrong copy numbers, or even break graphs at these branch points, generating an accurate but fragmented assembly [205].…”
Section: How To Identify and Classify Retrotransposonsmentioning
confidence: 99%
“…Interestingly, most of the software used to identify TEs requires assembled sequences as input, even though assembly algorithms have trouble with highly repetitive sections of genomes [4,66,203,204]. Repeats cause branches in graphs used in assembly algorithms (which can be one of two classes: overlap-based and De Bruijn graph) [205], leading assemblers to create false joins and wrong copy numbers, or even break graphs at these branch points, generating an accurate but fragmented assembly [205].…”
Section: How To Identify and Classify Retrotransposonsmentioning
confidence: 99%
“…reporting the number of reads per cluster, which can then be used to estimate the genome space occupied by each particular repeat, i.e., (total length of each cluster (in Mb) x genome size (in Mb)) / total length of all clusters (in Mb) (Macas et al 2015, Kelly et al 2015, Ramachandran et al 2016. For species-specific clustering, three million reads (approximately 0.2x to 0.5x genome coverage) were sub-sampled from each dataset and processed to the format required by RepeatExplorer.…”
Section: Species-specific Clustering Analysis Provides Information Rementioning
confidence: 99%
“…The maize TE annotation file (B73v4.TE.filtered.gff3) was downloaded from Gramene Release 62 (http://www.gramene.org) and sorghum retroelement annotations were retrieved from our previous work described in Ramachandran et al 2016. BEDtools closest (version 2.17.0; Quinlan and Hall 2010) was used to identify LTR-retrotransposon (gypsy and copia) insertions within 1-5 kb upstream of genes.…”
Section: Genomic Distribution Of Retroelementsmentioning
confidence: 99%
“…reporting the number of reads per cluster, which can then be used to estimate the genome space occupied by each particular repeat, i.e., (total length of each cluster (in Mb) x genome size (in Mb)) / total length of all clusters (in Mb) (Kelly et al 2015, Ramachandran et al 2016. For species-specific clustering, three million reads (approximately 0.2x to 0.5x genome coverage) were sub-sampled from each dataset and processed to the format required by RepeatExplorer.…”
Section: Species-specific Clustering Analysis Provides Information Rementioning
confidence: 99%