2022
DOI: 10.1101/2022.03.14.484272
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning deep representations of enzyme thermal adaptation

Abstract: Temperature is a fundamental environmental factor that shapes the evolution of organisms. Learning thermal determinants of protein sequences in evolution thus has profound significance for basic biology, drug discovery, and protein engineering. Here, we use a dataset of over 3 million enzymes labeled with optimal growth temperatures (OGT) of their source organisms to train a deep neural network model (DeepET). The protein-temperature representations learned by DeepET provide a temperature-related statistical s… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 58 publications
(84 reference statements)
0
3
0
Order By: Relevance
“…These measurements showed that 78.0% (recall) of all labeled HTTPs were predicted as HTTPs by the model, and 86.0% (precision) of all sequences predicted as HTTPs by the model were actually the labeled HTTPs in the test set (Table 1 ). Unlike state-of-the-art methods in protein thermostability prediction that utilize OGT data, such as DeepET [ 25 ] or Tome [ 26 ], our model does not directly predict metrics related to thermostability. Instead, it classifies sequences exhibiting HTTP traits, subsequently leveraging this classification as a pression in a virtual evolutionary process.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…These measurements showed that 78.0% (recall) of all labeled HTTPs were predicted as HTTPs by the model, and 86.0% (precision) of all sequences predicted as HTTPs by the model were actually the labeled HTTPs in the test set (Table 1 ). Unlike state-of-the-art methods in protein thermostability prediction that utilize OGT data, such as DeepET [ 25 ] or Tome [ 26 ], our model does not directly predict metrics related to thermostability. Instead, it classifies sequences exhibiting HTTP traits, subsequently leveraging this classification as a pression in a virtual evolutionary process.…”
Section: Resultsmentioning
confidence: 99%
“…In contrast with conventional stability prediction methods relying on scarce high-resolution data, our technique leverages large amount of OGT data from genomic sequences across diverse organisms. Some prior efforts have utilized OGT data to achieve moderate protein thermostability discrimination [ 25 , 27 ]. For instance, TemStaPro [ 27 ] exhibits higher accuracy and recall than our Thermo-selector.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation