2020
DOI: 10.3389/fbioe.2020.00285
|View full text |Cite
|
Sign up to set email alerts
|

A Method for Prediction of Thermophilic Protein Based on Reduced Amino Acids and Mixed Features

Abstract: The thermostability of proteins is a key factor considered during enzyme engineering, and finding a method that can identify thermophilic and non-thermophilic proteins will be helpful for enzyme design. In this study, we established a novel method combining mixed features and machine learning to achieve this recognition task. In this method, an amino acid reduction scheme was adopted to recode the amino acid sequence. Then, the physicochemical characteristics, auto-cross covariance (ACC), and reduced dipeptide… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
29
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 31 publications
(29 citation statements)
references
References 96 publications
(92 reference statements)
0
29
0
Order By: Relevance
“…When g = 0, the 0‐gap dipeptide of a protein can be defined as an amino acid pair without any other amino acid between them. The whole extraction process is similar to the work of Feng [17] and Tang, [35] and can be described as follows: …”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…When g = 0, the 0‐gap dipeptide of a protein can be defined as an amino acid pair without any other amino acid between them. The whole extraction process is similar to the work of Feng [17] and Tang, [35] and can be described as follows: …”
Section: Methodsmentioning
confidence: 99%
“…They constructed a new dataset using the data extracted from the Universal Protein Resource (UniProt) database [15] and calculated the mixed features, including the amino acid composition and dipeptide composition. After the feature selection step, an optimized feature set was processed by the naïve Bayes (NB [16, 17]) method, whose predictive accuracy of 66.88% suggests that the proposed method is an effective method for antioxidant protein identification. Later, Feng et al.…”
Section: Introductionmentioning
confidence: 99%
“…Below, we propose some example applications: First, we can now sort genomes or homologous gene sequences by their optimal growth temperature (and not just by phylogenetic origin), making it possible to explore universal strategies of adaptations to heat and cold, as opposed to idiosyncratic adaptations within each lineage of species. Such studies would have far-reaching implications in biotechnology as they can simplify the rational design of biological molecules and organisms with a desired thermal stability 33,61,68,7075 . Second, this database simplifies studies of molecular adaptations to temperature within any given range of temperatures. This is important, because to date most studies have focused on extremophiles 49,50 , leaving mostly unexplored how mesophilic organisms adapt to relatively subtle changes in the environment (e.g.…”
Section: Applicationsmentioning
confidence: 99%
“…First, we can now sort genomes or homologous gene sequences by their optimal growth temperature (and not just by phylogenetic origin), making it possible to explore universal strategies of adaptations to heat and cold, as opposed to idiosyncratic adaptations within each lineage of species. Such studies would have far-reaching implications in biotechnology as they can simplify the rational design of biological molecules and organisms with a desired thermal stability 33,61,68,7075 .…”
Section: Applicationsmentioning
confidence: 99%
“…Several computational efforts based on machine learning (ML) methods have been made in recent years to identify TPPs 20 , 21 , 24 33 as summarized in Table 1 . As can be seen from Table 1 , support vector machine (SVM) method is the most widely used technique for identifying TPPs 20 , 21 , 24 26 , 28 30 . For instance, Zhang and Fan 31 developed the first TPP predictor based on amino acid composition (AAC) descriptors.…”
Section: Introductionmentioning
confidence: 99%