“…In fact, several new models for compound-kinase binding prediction are introduced every month [CCAS + 15, CRA + 21, DQJ + 22, DSSGP22]. They differ in the learning algorithm used, such as simple k-nearest neighbor regression [BHS + 21], decision trees [TAA + 22], kernel learning [MM12, NPC16, CRP + 17, CPS + 18] and deep learning methods [BHS + 21, O18, KZEK23, LLP23, SSB + 23], as well as compound and protein descriptors, including compound SMILES and graphs [DTME20], protein amino acid sequences [BHS + 21, KZEK23] and, lately, more complex 3D structure-based features [KZK + 23, PHL + 23, LKN + 23, LTZ + 23] and embeddings from pretrained large language models [SSB + 23]. Most recent methods modeling compound-kinase activities learn from the descriptors of both compounds and kinases, and are referred to as proteochemometric models.…”