2021
DOI: 10.21105/joss.03678
|View full text |Cite
|
Sign up to set email alerts
|

Opfi: A Python package for identifying gene clusters in large genomics and metagenomics data sets

Abstract: Gene clusters are sets of co-localized, often contiguous genes that together perform specific functions, many of which are relevant to biotechnology. There is a need for software tools that can extract candidate gene clusters from vast amounts of available genomic data. Therefore, we developed Opfi: a modular pipeline for identification of arbitrary gene clusters in assembled genomic or metagenomic sequences. Opfi contains functions for annotation, de-deduplication, and visualization of putative gene clusters.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 13 publications
0
1
0
Order By: Relevance
“…Genomes containing CAST systems were collected from NCBI genomic databases (Pruitt et al, 2005). We searched for CRISPR-Cas systems in these genomes using Opfi, a Python library to search DNA sequencing data for putative CRISPR systems (Hill et al, 2021). First, we located all regions containing a CRISPR array that was not associated with a CAST.…”
Section: Methodsmentioning
confidence: 99%
“…Genomes containing CAST systems were collected from NCBI genomic databases (Pruitt et al, 2005). We searched for CRISPR-Cas systems in these genomes using Opfi, a Python library to search DNA sequencing data for putative CRISPR systems (Hill et al, 2021). First, we located all regions containing a CRISPR array that was not associated with a CAST.…”
Section: Methodsmentioning
confidence: 99%