2018
DOI: 10.1093/bioinformatics/bty437
|View full text |Cite
|
Sign up to set email alerts
|

Motif scraper: a cross-platform, open-source tool for identifying degenerate nucleotide motif matches in FASTA files

Abstract: The motif scraper package (MIT license) is available via PyPI, and the Python source is available on GitHub at https://github.com/RobersonLab/motif_scraper.

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 29 publications
0
5
0
Order By: Relevance
“…I calculated the GC content of each genome using a Python script with the pyfaidx package (v0.5.5.2) [8]. I then identified the potential CasX editing sites in each genome using Motif Scraper (v1.0.2) with the motif TTCNNNNNNNNNNNNNNNNNNNNN and multiple cores in file buffered mode [9]. The standard Motif Scraper options buffer all sites in memory for sorting later based on the order of contigs in the input FASTA file.…”
Section: Construction and Contentmentioning
confidence: 99%
“…I calculated the GC content of each genome using a Python script with the pyfaidx package (v0.5.5.2) [8]. I then identified the potential CasX editing sites in each genome using Motif Scraper (v1.0.2) with the motif TTCNNNNNNNNNNNNNNNNNNNNN and multiple cores in file buffered mode [9]. The standard Motif Scraper options buffer all sites in memory for sorting later based on the order of contigs in the input FASTA file.…”
Section: Construction and Contentmentioning
confidence: 99%
“…I calculated the GC content of each genome using a Python script with the pyfaidx package (v0.5.5.2) (Shirley et al 2015). I then identified the potential CasX editing sites in each genome using Motif Scraper (v1.0.2) with the motif TTCNNNNNNNNNNNNNNNNNNNNN and multiple cores in file buffered mode (Roberson 2018). The standard Motif Scraper options buffer all sites in memory for sorting later based on the order of contigs in the input FASTA file.…”
Section: Identification Of Casx Sitesmentioning
confidence: 99%
“…Typically, motifs conform to a particular sequence pattern, where certain positions can be constrained to a specific amino acid, whereas others are not [27]. This confers a high degeneration of the motifs yielding a huge list of non-redundant motif sequences and consequently, some motifs that are not characteristics of effector sequences only [28]. Furthermore, different amino acids (AAs) can have similar physicochemical properties, thus different motif sequences can share similar properties.…”
Section: Introductionmentioning
confidence: 99%