Learning the language of viral evolution and escape

Hie, Brian; Zhong, Ellen D.; Berger, Bonnie; Bryson, Bryan

doi:10.1101/2020.07.08.193946

Cited by 7 publications

(5 citation statements)

References 54 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Convalescent plasma from individuals who have recovered from COVID-19 has also been granted EUA for hospitalized patients with COVID-19 but efficacy data from randomized trials are needed. And despite massive roll-out of vaccines against SARS-CoV-2, the rapid emergence of novel strains amongst other practical issues raises serious concerns on the longevity and efficacy of this approach to confer herd immunity (Hie et al, 2021). Hence, there is an urgent unmet clinical need for an arsenal of potent antiviral therapeutics that can reduce COVID-19 associated morbidity and mortality (McCreary and Angus, 2020; Thorp, 2020).…”

Section: Introductionmentioning

confidence: 99%

The Prolyl-tRNA Synthetase Inhibitor Halofuginone Inhibits SARS-CoV-2 Infection

Sandoval

Clausen

Nora

et al. 2021

Preprint

View full text Add to dashboard Cite

We identify the prolyl-tRNA synthetase (PRS) inhibitor halofuginone, a compound in clinical trials for anti-fibrotic and anti-inflammatory applications, as a potent inhibitor of SARS-CoV-2 infection and replication. The interaction of SARS-CoV-2 spike protein with cell surface heparan sulfate (HS) promotes viral entry. We find that halofuginone reduces HS biosynthesis, thereby reducing spike protein binding, SARS-CoV-2 pseudotyped virus, and authentic SARS-CoV-2 infection. Halofuginone also potently suppresses SARS-CoV-2 replication post-entry. Utilizing analogues of halofuginone and small molecule inhibitors of the PRS, we establish that inhibition of HS presentation and viral replication is dependent on proline tRNA synthesis opposed to PRS activation of the integrated stress response (ISR). Moreover, we provide evidence that these effects are mediated by the depletion of proline tRNAs. In line with this, we find that SARS-CoV-2 polyproteins, as well as several HS proteoglycans, are particularly proline-rich, which may make them vulnerable to halofuginone translational suppression. Halofuginone is orally bioavailable, has been evaluated in a phase I clinical trial in humans and distributes to SARS-CoV-2 target organs, including the lung, making it a promising clinical trial candidate for the treatment of COVID-19.

show abstract

Section: Introductionmentioning

confidence: 99%

The Prolyl-tRNA Synthetase Inhibitor Halofuginone Inhibits SARS-CoV-2 Infection

Sandoval

Clausen

Nora

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Language models have been used to understand and predict viral mutations that evade neutralizing antibodies [35] . Language generation models can also be applied to synthetic protein design, as in ProGen and other works [56] , [110] , [4] .…”

Section: Language Generationmentioning

confidence: 99%

The language of proteins: NLP, machine learning & protein sequences

Ofer

Brandes

Linial

2021

Computational and Structural Biotechnology Journal

256

178

View full text Add to dashboard Cite

Natural language processing (NLP) is a field of computer science concerned with automated text and language analysis. In recent years, following a series of breakthroughs in deep and machine learning, NLP methods have shown overwhelming progress. Here, we review the success, promise and pitfalls of applying NLP algorithms to the study of proteins. Proteins, which can be represented as strings of amino-acid letters, are a natural fit to many NLP methods. We explore the conceptual similarities and differences between proteins and language, and review a range of protein-related tasks amenable to machine learning. We present methods for encoding the information of proteins as text and analyzing it with NLP methods, reviewing classic concepts such as bag-of-words, k-mers/n-grams and text search, as well as modern techniques such as word embedding, contextualized embedding, deep learning and neural language models. In particular, we focus on recent innovations such as masked language modeling, self-supervised learning and attention-based models. Finally, we discuss trends and challenges in the intersection of NLP and protein research.

show abstract

“…If these and other methods can be applied to the problem of AAV capsid engineering, AAV variant sequences with similar properties to each other would be close together in latent space after being transformed into their latent representations, even if they are far apart in sequence space. A similar strategy was recently used to predict the emergence of escape mutations in multiple viruses ( 22 ).…”

Section: Key Concepts For Applying Machine Learning To Engineer Novel Capsidsmentioning

confidence: 99%

Overcoming Immunological Challenges Limiting Capsid-Mediated Gene Therapy With Machine Learning

Wec¹,

Lin²,

Kwasnieski³

et al. 2021

Front. Immunol.

View full text Add to dashboard Cite

A key hurdle to making adeno-associated virus (AAV) capsid mediated gene therapy broadly beneficial to all patients is overcoming pre-existing and therapy-induced immune responses to these vectors. Recent advances in high-throughput DNA synthesis, multiplexing and sequencing technologies have accelerated engineering of improved capsid properties such as production yield, packaging efficiency, biodistribution and transduction efficiency. Here we outline how machine learning, advances in viral immunology, and high-throughput measurements can enable engineering of a new generation of de-immunized capsids beyond the antigenic landscape of natural AAVs, towards expanding the therapeutic reach of gene therapy.

show abstract

Learning the language of viral evolution and escape

Cited by 7 publications

References 54 publications

The Prolyl-tRNA Synthetase Inhibitor Halofuginone Inhibits SARS-CoV-2 Infection

The Prolyl-tRNA Synthetase Inhibitor Halofuginone Inhibits SARS-CoV-2 Infection

The language of proteins: NLP, machine learning & protein sequences

Overcoming Immunological Challenges Limiting Capsid-Mediated Gene Therapy With Machine Learning

Contact Info

Product

Resources

About