Polymer–protein hybrids are intriguing materials that can bolster protein stability in non‐native environments, thereby enhancing their utility in diverse medicinal, commercial, and industrial applications. One stabilization strategy involves designing synthetic random copolymers with compositions attuned to the protein surface, but rational design is complicated by the vast chemical and composition space. Here, a strategy is reported to design protein‐stabilizing copolymers based on active machine learning, facilitated by automated material synthesis and characterization platforms. The versatility and robustness of the approach is demonstrated by the successful identification of copolymers that preserve, or even enhance, the activity of three chemically distinct enzymes following exposure to thermal denaturing conditions. Although systematic screening results in mixed success, active learning appropriately identifies unique and effective copolymer chemistries for the stabilization of each enzyme. Overall, this work broadens the capabilities to design fit‐for‐purpose synthetic copolymers that promote or otherwise manipulate protein activity, with extensions toward the design of robust polymer–protein hybrid materials.
In this work, we present, evaluate, and analyze strategies for representing polymer chemistry to machine learning models for the advancement of data-driven sequence or composition design of macromolecules.
Among the many molecules that contribute to glial scarring, chondroitin sulfate proteoglycans (CSPGs) are known to be potent inhibitors of neuronal regeneration. Chondroitinase ABC (ChABC), a bacterial lyase, degrades the glycosaminoglycan (GAG) side chains of CSPGs and promotes tissue regeneration. However, ChABC is thermally unstable and loses all activity within a few hours at 37 °C under dilute conditions. To overcome this limitation, the discovery of a diverse set of tailor-made random copolymers that complex and stabilize ChABC at physiological temperature is reported. The copolymer designs, which are based on chain length and composition of the copolymers, are identified using an active machine learning paradigm, which involves iterative copolymer synthesis, testing for ChABC thermostability upon copolymer complexation, Gaussian process regression modeling, and Bayesian optimization. Copolymers are synthesized by automated PET-RAFT and thermostability of ChABC is assessed by retained enzyme activity (REA) after 24 h at 37 °C. Significant improvements in REA in three iterations of active learning are demonstrated while identifying exceptionally high-performing copolymers. Most remarkably, one designed copolymer promotes residual ChABC activity near 30%, even after one week and notably outperforms other common stabilization methods for ChABC. Together, these results highlight a promising pathway toward sustained tissue regeneration.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.