Preparing for the next COVID: Deep Reinforcement Learning trained Artificial Intelligence discovery of multi-modal immunomodulatory control of systemic inflammation in the absence of effective anti-microbials

Larie, Dale; An, Gary; Cockrell, Chase

doi:10.1101/2022.02.17.480940

Cited by 6 publications

(23 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The ability to manipulate any combination of mediators present is meant to simulate the potential use of combinations of interventions, which our prior work has suggested is necessary to effectively control sepsis (15)(16)(17); the DRL approach is intended to assist in addressing the exponential combinatorial issues associated with multi-drug therapy and the additional challenge needing to modify a particular treatment application to account for the temporal heterogeneity among individuals with regards to their disease trajectories.…”

Section: Initial and Termination Conditionsmentioning

confidence: 99%

“…We have previously reported on the challenges present in attempting to control sepsis using anti-cytokine/anti-mediator therapies, primarily stemming from the failures to recognize the dynamic complexity of the mechanistic processes ostensibly being targeted (12) and that in order to be effective the treatment of sepsis should be considered a complex control problem (13). In previous work we have shown that sepsis is potentially controllable by discovering multi-modal control strategies using different types of machine learning (ML) methods trained on a complex agent-based model of acute systemic inflammation (the Innate Immune Response Agent-based Model, or IIRABM ( 14)) (15)(16)(17). Specifically, the latter projects described in Refs (16,17) utilized the method, Deep Reinforcement Learning (DRL), employed by ML/Artificial Intelligence (AI) systems to successfully play and win a series of games against human experts (18)(19)(20)(21)(22).…”

Section: Introductionmentioning

confidence: 99%

“…In previous work we have shown that sepsis is potentially controllable by discovering multi-modal control strategies using different types of machine learning (ML) methods trained on a complex agent-based model of acute systemic inflammation (the Innate Immune Response Agent-based Model, or IIRABM ( 14)) (15)(16)(17). Specifically, the latter projects described in Refs (16,17) utilized the method, Deep Reinforcement Learning (DRL), employed by ML/Artificial Intelligence (AI) systems to successfully play and win a series of games against human experts (18)(19)(20)(21)(22). We term this approach simulation-based model-free DRL, and in prior work applied to method where we treated the attempt to control sepsis as a "game" to be played using the IIRABM, where potential cytokine interventions represented the "moves" implemented by the AI agent (16,17).…”

Section: Introductionmentioning

confidence: 99%

“…Specifically, the latter projects described in Refs (16,17) utilized the method, Deep Reinforcement Learning (DRL), employed by ML/Artificial Intelligence (AI) systems to successfully play and win a series of games against human experts (18)(19)(20)(21)(22). We term this approach simulation-based model-free DRL, and in prior work applied to method where we treated the attempt to control sepsis as a "game" to be played using the IIRABM, where potential cytokine interventions represented the "moves" implemented by the AI agent (16,17).…”

Section: Introductionmentioning

confidence: 99%

“…We propose that simulation-based control discovery using DRL can provide useful insights and potentially critical capabilities in designing effective multi-modal and adaptive immunomodulatory therapies for infections for which no effective anti-microbial agents exist. We have previously demonstrated in a proof-of-concept report that such a control policy can be discovered with DRL when manipulating up to 11 different mediators and soluble factors every 6 minutes (17). We now extend that study to evaluate whether DRL can train an artificial neural network (ANN) to discover a treatment policy utilizing existing anti-cytokine drugs to improve the outcomes to simulated infection in the absence of anti-microbial treatment.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Preparing for the next Pandemic: Simulation-based Deep Reinforcement Learning to discover and test multimodal control of systemic inflammation using repurposed immunomodulatory agents

Cockrell

Larie

2022

Preprint

Self Cite

View full text Add to dashboard Cite

Background: Preparation to address the critical gap in a future pandemic between non-pharmacological measures and the deployment of new drugs/vaccines requires addressing two factors: 1) finding virus/pathogen-agnostic pathophysiological targets to mitigate disease severity and 2) finding a more rational approach to repurposing existing drugs. It is increasingly recognized that acute viral disease severity is heavily driven by the immune response to the infection (cytokine storm). There exist numerous clinically available biologics that suppress various pro-inflammatory cytokines/mediators, but it is extremely difficult to identify clinically effective treatment regimens with these agents. We propose that this is a complex control problem that resists standard methods of developing treatment regimens and accomplishing this goal requires the application of simulation-based, model-free deep reinforcement learning (DRL) in a fashion akin to training successful game-playing artificial intelligences (AIs). This proof-of-concept study determines if simulated sepsis (e.g. infection-driven cytokine storm) can be controlled in the absence of effective antimicrobial agents by targeting cytokines for which FDA-approved biologics currently exist. Methods: We use a previously validated agent-based model, the Innate Immune Response Agent-based Model (IIRABM), for control discovery using DRL. DRL training used a Deep Deterministic Policy Gradient (DDPG) approach with a clinically plausible control interval of 6 hours with manipulation of six cytokines for which there are existing drugs: Tumor Necrosis Factor (TNF), Interleukin-1 (IL-1), Interleukin-4 (IL-4), Interleukin-8 (IL-8), Interleukin-12 (IL-12) and Interferon-γ (IFNg). Results: DRL trained an AI policy that could improve outcomes from a baseline mortality rate of 41% (= recovery rate of 59%) to one with a recovery rate of 82.3% over 42 days simulated time. Discussion: The current proof-of-concept study demonstrates that significant disease severity mitigation can potentially be accomplished with existing anti-mediator drugs, but only through a multi-modal, adaptive treatment policy requiring implementation with an AI. While the actual clinical implementation of this approach is a projection for the future, the current goal of this work is to inspire the development of a research ecosystem that marries what is needed to improve the simulation models with the development of the sensing/assay technologies to collect the data needed to iteratively refine those models.

show abstract

Section: Initial and Termination Conditionsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Preparing for the next Pandemic: Simulation-based Deep Reinforcement Learning to discover and test multimodal control of systemic inflammation using repurposed immunomodulatory agents

Cockrell

Larie

2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

Chemical structures of cyclic ADP ribose (cADPR) isomers and the molecular basis of their production and signaling

Manik

Shi

et al. 2022

Preprint

View full text Add to dashboard Cite

Cyclic ADP ribose (cADPR) isomers are important signaling molecules produced by bacterial and plant Toll/interleukin-1 receptor (TIR) domains via NAD+ hydrolysis, yet their chemical structures are unknown. We show that v-cADPR (2’cADPR) and v2-cADPR (3’cADPR) isomers are cyclized by O-glycosidic bond formation between the ribose moieties in ADPR. Structures of v-cADPR (2’cADPR)-producing TIR domains reveal that conformational changes are required for the formation of the active assembly that resembles those of Toll-like receptor adaptor TIR domains, and mutagenesis data demonstrate that a conserved tryptophan is essential for cyclization. We show that v2-cADPR (3’cADPR) is a potent activator of ThsA effector proteins from Thoeris anti-phage defence systems and is responsible for suppression of plant immunity by the effector HopAM1. Collectively, our results define new enzymatic activities of TIR domains, reveal the molecular basis of cADPR isomer production, and establish v2-cADPR (3’cADPR) as an antiviral signaling molecule and an effector-mediated signaling molecule for plant immunity suppression.One-Sentence SummaryThe chemical structures of two O-glycosidic bond-containing cyclic ADP ribose isomers, the molecular basis of their production, and their function in antiviral and plant immunity suppression by bacteria are reported.

show abstract

Preparing for the next pandemic: Simulation-based deep reinforcement learning to discover and test multimodal control of systemic inflammation using repurposed immunomodulatory agents

Cockrell

Larie

2022

Front. Immunol.

Self Cite

View full text Add to dashboard Cite

BackgroundPreparation to address the critical gap in a future pandemic between non-pharmacological measures and the deployment of new drugs/vaccines requires addressing two factors: 1) finding virus/pathogen-agnostic pathophysiological targets to mitigate disease severity and 2) finding a more rational approach to repurposing existing drugs. It is increasingly recognized that acute viral disease severity is heavily driven by the immune response to the infection (“cytokine storm” or “cytokine release syndrome”). There exist numerous clinically available biologics that suppress various pro-inflammatory cytokines/mediators, but it is extremely difficult to identify clinically effective treatment regimens with these agents. We propose that this is a complex control problem that resists standard methods of developing treatment regimens and accomplishing this goal requires the application of simulation-based, model-free deep reinforcement learning (DRL) in a fashion akin to training successful game-playing artificial intelligences (AIs). This proof-of-concept study determines if simulated sepsis (e.g. infection-driven cytokine storm) can be controlled in the absence of effective antimicrobial agents by targeting cytokines for which FDA-approved biologics currently exist.MethodsWe use a previously validated agent-based model, the Innate Immune Response Agent-based Model (IIRABM), for control discovery using DRL. DRL training used a Deep Deterministic Policy Gradient (DDPG) approach with a clinically plausible control interval of 6 hours with manipulation of six cytokines for which there are existing drugs: Tumor Necrosis Factor (TNF), Interleukin-1 (IL-1), Interleukin-4 (IL-4), Interleukin-8 (IL-8), Interleukin-12 (IL-12) and Interferon-γ(IFNg).ResultsDRL trained an AI policy that could improve outcomes from a baseline Recovered Rate of 61% to one with a Recovered Rate of 90% over ~21 days simulated time. This DRL policy was then tested on four different parameterizations not seen in training representing a range of host and microbe characteristics, demonstrating a range of improvement in Recovered Rate by +33% to +56%DiscussionThe current proof-of-concept study demonstrates that significant disease severity mitigation can potentially be accomplished with existing anti-mediator drugs, but only through a multi-modal, adaptive treatment policy requiring implementation with an AI. While the actual clinical implementation of this approach is a projection for the future, the current goal of this work is to inspire the development of a research ecosystem that marries what is needed to improve the simulation models with the development of the sensing/assay technologies to collect the data needed to iteratively refine those models.

show abstract

Preparing for the next COVID: Deep Reinforcement Learning trained Artificial Intelligence discovery of multi-modal immunomodulatory control of systemic inflammation in the absence of effective anti-microbials

Cited by 6 publications

References 32 publications

Preparing for the next Pandemic: Simulation-based Deep Reinforcement Learning to discover and test multimodal control of systemic inflammation using repurposed immunomodulatory agents

Preparing for the next Pandemic: Simulation-based Deep Reinforcement Learning to discover and test multimodal control of systemic inflammation using repurposed immunomodulatory agents

Chemical structures of cyclic ADP ribose (cADPR) isomers and the molecular basis of their production and signaling

Preparing for the next pandemic: Simulation-based deep reinforcement learning to discover and test multimodal control of systemic inflammation using repurposed immunomodulatory agents

Contact Info

Product

Resources

About