2016
DOI: 10.1016/j.sbi.2016.05.017
|View full text |Cite
|
Sign up to set email alerts
|

Novel function discovery through sequence and structural data mining

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
26
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 37 publications
(27 citation statements)
references
References 102 publications
(97 reference statements)
0
26
0
Order By: Relevance
“…Given the extreme sequence diversity of the hypervariable region and its potential as a scaffold for surface-localized functions, we hypothesized that there are flagellins with novel functionality that can be predicted from available bacterial genomes. Here, by computational identification of unexpected domain fusions 18 , we report the discovery and experimental validation of a family of enzymatically active flagellins present in the genomes of 74 bacterial species including the pathogenic clostridia Clostridium haemolyticum , and strains of Clostridium novyi and Clostridium botulinum . We find that these flagellins harbor a catalytically active zinc-metallopeptidase domain that is localized to extracellular flagellar filaments, so resulting in flagella-embedded protease activity in structures of up to 10 µm.…”
Section: Introductionmentioning
confidence: 99%
“…Given the extreme sequence diversity of the hypervariable region and its potential as a scaffold for surface-localized functions, we hypothesized that there are flagellins with novel functionality that can be predicted from available bacterial genomes. Here, by computational identification of unexpected domain fusions 18 , we report the discovery and experimental validation of a family of enzymatically active flagellins present in the genomes of 74 bacterial species including the pathogenic clostridia Clostridium haemolyticum , and strains of Clostridium novyi and Clostridium botulinum . We find that these flagellins harbor a catalytically active zinc-metallopeptidase domain that is localized to extracellular flagellar filaments, so resulting in flagella-embedded protease activity in structures of up to 10 µm.…”
Section: Introductionmentioning
confidence: 99%
“…Once the structure of a protein is known, residues critical for stability, as well as its active sites can often be located, and targeted mutations can be introduced. Similar structures can be identified and used to infer about the studied protein . Similarly, resolution of protein complexes can provide valuable information concerning the interaction mechanisms of proteins, allowing for the improved characterization and manipulation of signaling pathways in the cell.…”
Section: Introductionmentioning
confidence: 99%
“…Similar structures can be identified and used to infer about the studied protein. 1 Similarly, resolution of protein complexes can provide valuable information concerning the interaction mechanisms of proteins, allowing for the improved characterization and manipulation of signaling pathways in the cell. Once the details of the interface are known, inhibitors and activators of the target proteins can be designed based on features of the binding interface.…”
Section: Introductionmentioning
confidence: 99%
“…These characteristics suggest that FUnkFams are bona fide protein families, and the associations of specific FUnkFams with marine environments or body sites provide hints about protein function and ecology. This study therefore lays the groundwork for significant future work to (i) predict (e.g., via genome proximity and further metagenome profiling (Lobb and Doxey, 2016) or literature based similarity (Price and Arkin, 2017)) and (ii) experimentally validate (e.g., via biochemical and structural characterization (Gerlt et al, 2011)) the functions of FUnkFams and the unannotated protein domains they contain. Our approach can be flexibly extended to use other databases of gene families and sources of functional annotation, and it will be interesting to apply it to other protein catalogs as well as RNA genes.…”
Section: Figurementioning
confidence: 99%