2019
DOI: 10.1101/522342
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Machine learning applied to predicting microorganism growth temperatures and enzyme catalytic optima

Abstract: Enzymes that catalyze chemical reactions at high temperatures are used for industrial biocatalysis, applications in molecular biology, and as highly evolvable starting points for protein engineering. The optimal growth temperature (OGT) of organisms is commonly used to estimate the stability of enzymes encoded in their genomes, but the number of experimentally determined OGT values are limited, particularly for thermophilic organisms. Here, we report on the development of a machine learning model that can accu… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 49 publications
0
2
0
Order By: Relevance
“…The GTDB framework also enables linking phenotypes to genomes, as well as multiple sequences from the same genome across alignments, which can aid studies of protein and RNA complexes. We used a machine learning approach 38 to assign an optimal growth temperature to each reference genome in the GTDB, building on experimental measurements 36,37 . We then tested whether these temperatures, assigned to the RNAs identified in the GTDB genomes, could be used to identify thermophilic mutations that stabilize the E. coli ribosome.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…The GTDB framework also enables linking phenotypes to genomes, as well as multiple sequences from the same genome across alignments, which can aid studies of protein and RNA complexes. We used a machine learning approach 38 to assign an optimal growth temperature to each reference genome in the GTDB, building on experimental measurements 36,37 . We then tested whether these temperatures, assigned to the RNAs identified in the GTDB genomes, could be used to identify thermophilic mutations that stabilize the E. coli ribosome.…”
Section: Discussionmentioning
confidence: 99%
“…However, since the TEMPURA and Gosha databases only include cultivated species, they only have experimental OGTs for 15% of the GTDB reference species. We therefore inferred OGTs of all GTDB reference genomes using TOME 38 . TOME predicts the OGT for an organism using a machine learning model trained on proteome-wide dipeptide (2-mer) distributions.…”
Section: Mapping Of Optimal Growth Temperatures To Gtdb Reference Gen...mentioning
confidence: 99%