“…Previously, machine-learning methods in computational biology leveraged data-driven protein representations such as substitution matrices, capturing biophysical features (Henikoff & Henikoff, 1992), family-specific profiles (Stormo et al, 1982), or evolutionary couplings (Morcos et al, 2011) that capture evolutionary features. Now, embeddings provide competitive results for many prediction tasks (Littmann et al, 2021;Rao et al, 2019Rao et al, , 2020. Protein LMs may even be combined with other representations to gain even better performance (Rives et al, 2019;Villegas-Morcillo et al, 2020).…”