Democratizing protein language models with parameter-efficient fine-tuning

Sledzieski, Samuel; Kshirsagar, Meghana; Baek, Minkyung; Dodhia, Rahul; Lavista Ferres, Juan; Berger, Bonnie

doi:10.1073/pnas.2405840121

Cited by 10 publications

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Applicability of AlphaFold2 in the modeling of dimeric, trimeric, and tetrameric coiled‐coil domains

Madaj,

Martinez‐Goikoetxea,

Kaminski

et al. 2024

Protein Science

View full text Add to dashboard Cite

Coiled coils are a common protein structural motif involved in cellular functions ranging from mediating protein–protein interactions to facilitating processes such as signal transduction or regulation of gene expression. They are formed by two or more alpha helices that wind around a central axis to form a buried hydrophobic core. Various forms of coiled‐coil bundles have been reported, each characterized by the number, orientation, and degree of winding of the constituent helices. This variability is underpinned by short sequence repeats that form coiled coils and whose properties determine both their overall topology and the local geometry of the hydrophobic core. The strikingly repetitive sequence has enabled the development of accurate sequence‐based coiled‐coil prediction methods; however, the modeling of coiled‐coil domains remains a challenging task. In this work, we evaluated the accuracy of AlphaFold2 in modeling coiled‐coil domains, both in modeling local geometry and in predicting global topological properties. Furthermore, we show that the prediction of the oligomeric state of coiled‐coil bundles can be achieved by using the internal representations of AlphaFold2, with a performance better than any previous state‐of‐the‐art method (code available at https://github.com/labstructbioinf/dc2_oligo).

show abstract

Applicability of AlphaFold2 in the modeling of dimeric, trimeric, and tetrameric coiled‐coil domains

Madaj,

Martinez‐Goikoetxea,

Kaminski

et al. 2024

Protein Science

View full text Add to dashboard Cite

show abstract

Fine-tuning protein language models boosts predictions across diverse tasks

Schmirler,

Heinzinger,

Rost

2024

Nat Commun

View full text Add to dashboard Cite

Prediction methods inputting embeddings from protein language models have reached or even surpassed state-of-the-art performance on many protein prediction tasks. In natural language processing fine-tuning large language models has become the de facto standard. In contrast, most protein language model-based protein predictions do not back-propagate to the language model. Here, we compare the fine-tuning of three state-of-the-art models (ESM2, ProtT5, Ankh) on eight different tasks. Two results stand out. Firstly, task-specific supervised fine-tuning almost always improves downstream predictions. Secondly, parameter-efficient fine-tuning can reach similar improvements consuming substantially fewer resources at up to 4.5-fold acceleration of training over fine-tuning full models. Our results suggest to always try fine-tuning, in particular for problems with small datasets, such as for fitness landscape predictions of a single protein. For ease of adaptability, we provide easy-to-use notebooks to fine-tune all models used during this work for per-protein (pooling) and per-residue prediction tasks.

show abstract