2020
DOI: 10.1186/s12859-020-03794-x
|View full text |Cite
|
Sign up to set email alerts
|

Implementation of homology based and non-homology based computational methods for the identification and annotation of orphan enzymes: using Mycobacterium tuberculosis H37Rv as a case study

Abstract: Background Homology based methods are one of the most important and widely used approaches for functional annotation of high-throughput microbial genome data. A major limitation of these methods is the absence of well-characterized sequences for certain functions. The non-homology methods based on the context and the interactions of a protein are very useful for identifying missing metabolic activities and functional annotation in the absence of significant sequence similarity. In the current w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 58 publications
(84 reference statements)
1
4
0
Order By: Relevance
“…Of the 12 genes, one shows evidence of having been acquired by horizontal gene transfer, and it was only possible to predict two putative functional protein domains in the others, illustrating the challenge of working with novel genes. The overall lack of functional annotation for the identified genes reflects the limitation of homology-based algorithms for genome annotation [54], especially when it comes to species phylogenetically distant from the extensively annotated S. cerevisiae [55]. This matches the experience of previous studies with K. marxianus, Y. lipolytica and Lachancea kluyveri [37,56].…”
Section: Discussionsupporting
confidence: 78%
“…Of the 12 genes, one shows evidence of having been acquired by horizontal gene transfer, and it was only possible to predict two putative functional protein domains in the others, illustrating the challenge of working with novel genes. The overall lack of functional annotation for the identified genes reflects the limitation of homology-based algorithms for genome annotation [54], especially when it comes to species phylogenetically distant from the extensively annotated S. cerevisiae [55]. This matches the experience of previous studies with K. marxianus, Y. lipolytica and Lachancea kluyveri [37,56].…”
Section: Discussionsupporting
confidence: 78%
“… [34] ). Nonetheless, given the challenges associated with the annotation of “unknown” or “orphan” molecules, ML and structure-based tools (established recently; [89] , [90] , [91] , [92] ) appeared to provide a useful alternative to sequence-based methods, although often their performance had not been thoroughly evaluated individually or in combination [93] . To address this issue, we evaluated individual tools and optimised the cut-off values for select algorithms (cf.…”
Section: Discussionmentioning
confidence: 99%
“…We additionally applied gene/protein enrichment and network analysis to the set of the above six mutated genes by making use of the STRING tool. This kind of analysis was shown to be useful in recent studies aimed to decipher the emergence of drug resistance, and to elucidate the potential gene network and signaling pathways in pulmonary TB through gene/protein network analysis [22][23][24].…”
Section: Gene/protein Enrichment and Network Analysismentioning
confidence: 99%