Precision medicine as a control problem: Using simulation and deep reinforcement learning to discover adaptive, personalized multi-cytokine therapy for sepsis

Petersen, Brenden K.; Yang, Jiachen; Grathwohl, Will; Cockrell, Chase; Santiago, Claudio; An, Gary; Faissol, Daniel M.

doi:10.48550/arxiv.1802.10440

Cited by 5 publications

(8 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The obtained results in [181] depict the superior performance of the proposed mixture model compared with applying the strategies of physicians, Kernel learning only and DRL only. In [182], the authors leverage DDPG scheme to deal with the continuous state and action spaces of the sepsis environment, hence defining an effective treatment strategy for sepsis.…”

Section: Remote Monitoring Applicationsmentioning

confidence: 99%

Reinforcement Learning for Intelligent Healthcare Systems: A Comprehensive Survey

Abdellatif¹,

Mhaisen²,

Chkirbene³

et al. 2021

Preprint

View full text Add to dashboard Cite

The rapid increase in the percentage of chronic disease patients along with the recent pandemic pose immediate threats on healthcare expenditure and elevate causes of death. This calls for transforming healthcare systems away from one-on-one patient treatment into intelligent health systems, to improve services, access and scalability, while reducing costs. Reinforcement Learning (RL) has witnessed an intrinsic breakthrough in solving a variety of complex problems for diverse applications and services. Thus, we conduct in this paper a comprehensive survey of the recent models and techniques of RL that have been developed/used for supporting Intelligent-healthcare (I-health) systems. This paper can guide the readers to deeply understand the state-of-theart regarding the use of RL in the context of I-health. Specifically, we first present an overview for the I-health systems challenges, architecture, and how RL can benefit these systems. We then review the background and mathematical modeling of different RL, Deep RL (DRL), and multiagent RL models. After that, we provide a deep literature review for the applications of RL in I-health systems. In particular, three main areas have been tackled, i.e., edge intelligence, smart core network, and dynamic treatment regimes. Finally, we highlight emerging challenges and outline future research directions in driving the future success of RL in I-health systems, which opens the door for exploring some interesting and unsolved problems.

show abstract

Section: Remote Monitoring Applicationsmentioning

confidence: 99%

Reinforcement Learning for Intelligent Healthcare Systems: A Comprehensive Survey

Abdellatif¹,

Mhaisen²,

Chkirbene³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…The majority of research using RL in healthcare is in dynamic treatment regimes, where the goal is to develop effective treatment regimes that can dynamically adapt to the varying clinical states and improve the long-term outcomes for patients (Yu et al, 2019b). This includes DTR for diseases such as cancer (Zhao, Kosorok, & Zeng, 2009;Liu, Logan, Liu, Xu, Tang, & Wang, 2017), diabetes (Daskalaki, Scarnato, Diem, & Mougiakakou, 2010;Bothe, Dickens, Reichel, Tellmann, Ellger, Westphal, & Faisal, 2013;Daskalaki, Diem, & Mougiakakou, 2013), anemia (Malof & Gaweda, 2011;Escandell-Montero, Chermisi, Martinez-Martinez, Gomez-Sanchis, Barbieri, Soria-Olivas, Mari, Vila-Francés, Stopper, Gatti, et al, 2014), HIV (Parbhoo, 2014;Parbhoo, Bogojeska, Zazzi, Roth, & Doshi-Velez, 2017;Yu, Dong, Liu, & Ren, 2019a), mental illnesses (Paredes, Gilad-Bachrach, Czerwinski, Roseway, Rowan, & Hernandez, 2014;Pineau, Guez, Vincent, Panuccio, & Avoli, 2009), and DTR in critical care (Weng, Gao, He, Yan, & Szolovits, 2017;Petersen, Yang, Grathwohl, Cockrell, Santiago, An, & Faissol, 2018).…”

Section: Related Workmentioning

confidence: 99%

An Analysis of Reinforcement Learning for Malaria Control

Makondo¹,

Folarin²,

Zitha³

et al. 2021

Preprint

View full text Add to dashboard Cite

Previous work on policy learning for Malaria control has often formulated the problem as an optimization problem assuming the objective function and the search space have a specific structure. The problem has been formulated as multi-armed bandits, contextual bandits and a Markov Decision Process in isolation. Furthermore, an emphasis is put on developing new algorithms specific to an instance of Malaria control, while ignoring a plethora of simpler and general algorithms in the literature. In this work, we formally study the formulation of Malaria control and present a comprehensive analysis of several formulations used in the literature. In addition, we implement and analyze several reinforcement learning algorithms in all formulations and compare them to black box optimization. In contrast to previous work, our results show that simple algorithms based on Upper Confidence Bounds are sufficient for learning good Malaria policies, and tend to outperform their more advanced counterparts on the malaria OpenAI Gym environment.

show abstract

“…Targeting at glycemic regulation problems for severely ill septic patients, Weng et al [204] applied PI to learn the optimal targeted blood glucose levels from real data trajectories. Petersen et al [205] investigated the cytokine mediation problem in sepsis treatment, using the DRL method, Deep Deterministic Policy Gradient (DDPG) [233], to tackle the hi-dimensional continuous states and actions, and potential-based reward shaping [234] to facilitate the learning efficiency. The proposed approach was evaluated using an agent-based model, the Innate Immune Response Agent-Based Model (IIRABM), that simulates the immune response to infection.…”

Section: B Critical Carementioning

confidence: 99%

Reinforcement Learning in Healthcare: A Survey

Yu¹,

Liu²,

Nemati³

2019

Preprint

View full text Add to dashboard Cite

As a subfield of machine learning, reinforcement learning (RL) aims at empowering one's capabilities in behavioural decision making by using interaction experience with the world and an evaluative feedback. Unlike traditional supervised learning methods that usually rely on one-shot, exhaustive and supervised reward signals, RL tackles with sequential decision making problems with sampled, evaluative and delayed feedback simultaneously. Such distinctive features make RL technique a suitable candidate for developing powerful solutions in a variety of healthcare domains, where diagnosing decisions or treatment regimes are usually characterized by a prolonged and sequential procedure. This survey will discuss the broad applications of RL techniques in healthcare domains, in order to provide the research community with systematic understanding of theoretical foundations, enabling methods and techniques, existing challenges, and new insights of this emerging paradigm. By first briefly examining theoretical foundations and key techniques in RL research from efficient and representational directions, we then provide an overview of RL applications in a variety of healthcare domains, ranging from dynamic treatment regimes in chronic diseases and critical care, automated medical diagnosis from both unstructured and structured clinical data, as well as many other control or scheduling domains that have infiltrated many aspects of a healthcare system. Finally, we summarize the challenges and open issues in current research, and point out some potential solutions and directions for future research.

show abstract

Precision medicine as a control problem: Using simulation and deep reinforcement learning to discover adaptive, personalized multi-cytokine therapy for sepsis

Cited by 5 publications

References 24 publications

Reinforcement Learning for Intelligent Healthcare Systems: A Comprehensive Survey

Reinforcement Learning for Intelligent Healthcare Systems: A Comprehensive Survey

An Analysis of Reinforcement Learning for Malaria Control

Reinforcement Learning in Healthcare: A Survey

Contact Info

Product

Resources

About