2019
DOI: 10.1186/s12864-019-5516-5
|View full text |Cite
|
Sign up to set email alerts
|

Patterns of microsatellite distribution across eukaryotic genomes

Abstract: Background Microsatellites, or Simple Sequence Repeats (SSRs), are short tandem repeats of 1–6 nt motifs present in all genomes. Emerging evidence points to their role in cellular processes and gene regulation. Despite the huge resource of genomic information currently available, SSRs have been studied in a limited context and compared across relatively few species. Results We have identified ~ 685 million eukaryotic microsatellites and analyzed their genomic trends acr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

13
98
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 91 publications
(111 citation statements)
references
References 32 publications
13
98
0
Order By: Relevance
“…The 55 complete genomes of Coronaviridae families were retrieved on March 23, 2020 (See Supplementary Material 1, Sheet 2 for more details, we used only RefSeq Nucleotides with complete annotations) and were scanned in search of SSRs using a Python package, PERF [ 2 ]. A minimum length of SSRs was chosen to be 12 nt [ 24 , 34 ] which represents at least two complete repeating units of a 6-mer motif (hexamer). We used all theoretically possible 501 unique classes of SSRs as described in a study [ 34 , 35 ] to identify their presence/absence in Coronaviridae genomes by using the following command: “PERF -isequence.fasta -a -o sequence_perf_default.tsv” .…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The 55 complete genomes of Coronaviridae families were retrieved on March 23, 2020 (See Supplementary Material 1, Sheet 2 for more details, we used only RefSeq Nucleotides with complete annotations) and were scanned in search of SSRs using a Python package, PERF [ 2 ]. A minimum length of SSRs was chosen to be 12 nt [ 24 , 34 ] which represents at least two complete repeating units of a 6-mer motif (hexamer). We used all theoretically possible 501 unique classes of SSRs as described in a study [ 34 , 35 ] to identify their presence/absence in Coronaviridae genomes by using the following command: “PERF -isequence.fasta -a -o sequence_perf_default.tsv” .…”
Section: Methodsmentioning
confidence: 99%
“…A minimum length of SSRs was chosen to be 12 nt [ 24 , 34 ] which represents at least two complete repeating units of a 6-mer motif (hexamer). We used all theoretically possible 501 unique classes of SSRs as described in a study [ 34 , 35 ] to identify their presence/absence in Coronaviridae genomes by using the following command: “PERF -isequence.fasta -a -o sequence_perf_default.tsv” . The interactive.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We set L max as the longest length of micro-satellites. The lengths of micro-satellites are generally less than 50kbps [27]. Thus, L max is set to be 50kbps.…”
Section: Sb-readmentioning
confidence: 99%
“…Further studies showed the genomic distribution of SSRs is nonrandom. SSRs in genes may influence gene transcription or translation and gene activity [6,23], and recent studies showed a higher abundance of SSRs in response to environmental stress [24]. The polymorphism levels and potential functions of SSRs differ among the 5 untranslated region (5 UTR), the 3 UTR and coding sequences (CDs) are different; SSRs in 5 UTR may affect transcription or translation, SSRs in CDS may inactivate or activate genes, or truncate proteins, and SSRs in 3 UTR may cause silencing or slippage [25].…”
Section: Introductionmentioning
confidence: 99%