Recent progress in
engineering highly promising biocatalysts
has
increasingly involved machine learning methods. These methods leverage
existing experimental and simulation data to aid in the discovery
and annotation of promising enzymes, as well as in suggesting beneficial
mutations for improving known targets. The field of machine learning
for protein engineering is gathering steam, driven by recent success
stories and notable progress in other areas. It already encompasses
ambitious tasks such as understanding and predicting protein structure
and function, catalytic efficiency, enantioselectivity, protein dynamics,
stability, solubility, aggregation, and more. Nonetheless, the field
is still evolving, with many challenges to overcome and questions
to address. In this Perspective, we provide an overview of ongoing
trends in this domain, highlight recent case studies, and examine
the current limitations of machine learning-based methods. We emphasize
the crucial importance of thorough experimental validation of emerging
models before their use for rational protein design. We present our
opinions on the fundamental problems and outline the potential directions
for future research.