Personalized Dynamic Treatment Regimes in Continuous Time: A Bayesian Approach for Optimizing Clinical Decisions with Timing

Hua, William; Mei, Hongyuan; Zohar, Sarah; Giral, Magali; Xu, Yanxun

doi:10.1214/21-ba1276

Cited by 6 publications

(5 citation statements)

References 53 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In our case, for example, the problem is somewhat simplified; we have discretized time, and we have not taken into account mortality and morbidity in the reward. This has allowed us to avoid methodology needed for survival analysis 71 and continuous time, 72 which could be the subject of future work. Note also that reward function learning is an active area of research 8,73 …”

Section: Discussionmentioning

confidence: 99%

Relative sparsity for medical decision problems

Weisenthal

Thurston

Ertefaie

2023

Statistics in Medicine

View full text Add to dashboard Cite

Existing statistical methods can estimate a policy, or a mapping from covariates to decisions, which can then instruct decision makers (eg, whether to administer hypotension treatment based on covariates blood pressure and heart rate). There is great interest in using such data‐driven policies in healthcare. However, it is often important to explain to the healthcare provider, and to the patient, how a new policy differs from the current standard of care. This end is facilitated if one can pinpoint the aspects of the policy (ie, the parameters for blood pressure and heart rate) that change when moving from the standard of care to the new, suggested policy. To this end, we adapt ideas from Trust Region Policy Optimization (TRPO). In our work, however, unlike in TRPO, the difference between the suggested policy and standard of care is required to be sparse, aiding with interpretability. This yields “relative sparsity,” where, as a function of a tuning parameter, λ$$ \lambda $$, we can approximately control the number of parameters in our suggested policy that differ from their counterparts in the standard of care (eg, heart rate only). We propose a criterion for selecting λ$$ \lambda $$, perform simulations, and illustrate our method with a real, observational healthcare dataset, deriving a policy that is easy to explain in the context of the current standard of care. Our work promotes the adoption of data‐driven decision aids, which have great potential to improve health outcomes.

show abstract

Section: Discussionmentioning

confidence: 99%

Relative sparsity for medical decision problems

Weisenthal

Thurston

Ertefaie

2023

Statistics in Medicine

View full text Add to dashboard Cite

show abstract

“…[ 10 ] proposed a machine learning model based on the SOFA score for the prediction of mortality in critically ill patients. [ 38 ] developed a two-step Bayesian approach to optimize clinical decisions on timing, and the result shows that the proposed model are clinically useful to improve the survival of patients. The model of the research can be extended to other severity scoring systems, such as SOFA, GCS, and CT.…”

Section: Conclusion and Discussionmentioning

confidence: 99%

Mortality prediction in ICU Using a Stacked Ensemble Model

Ren

Zhao

Zhang

2022

Computational and Mathematical Methods in Medicine

View full text Add to dashboard Cite

Artificial intelligence (AI) technology has huge scope in developing models to predict the survival rate of critically ill patients in the intensive care unit (ICU). The availability of electronic clinical data has led to the widespread use of various machine learning approaches in this field. Innovative algorithms play a crucial role in boosting the performance of models. This study uses a stacked ensemble model to predict mortality in ICU by incorporating the clinical severity scoring results, in which several machine learning algorithms are employed to compare the performance. The experimental results show that the stacked ensemble model achieves good performance compared with the model without integrating the severity scoring results, which has the area under curve (AUC) of 0.879 and 0.862, respectively. To improve the performance of prediction, two feature subsets are obtained based on different feature selection techniques, labeled as SetS and SetT. Evaluation performances show that the SEM based on the SetS achieves a higher AUC value (0.879 and 0.860). Finally, the SHapley Additive exPlanations (SHAP) analysis is employed to interpret the correlation between the risk features and the outcome.

show abstract

“…All technical proofs are provided in the Supplementary Material. In addition, in the Supplementary Material, we demonstrate the usefulness of the proposed methods by conducting a simulation study, in which we simulate a synthetic dataset that mimics a real-world electronic medical record dataset for kidney transplantation patients (Hua et al, 2021).…”

Section: Data Coveragementioning

confidence: 99%

“…We demonstrate the usefulness of the proposed methods by conducting a simulation study, in which we simulate a synthetic dataset that mimics a real-world electronic medical record dataset for kidney transplantation patients (Hua et al, 2021). Kidney transplantation is the primary treatment for patients with chronic kidney disease or end-stage renal disease (Arshad et al, 2019).…”

Section: H Numerical Simulationmentioning

confidence: 99%

Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes

Fu¹,

Qi²,

Wang³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

We study the offline reinforcement learning (RL) in the face of unmeasured confounders. Due to the lack of online interaction with the environment, offline RL is facing the following two significant challenges: (i) the agent may be confounded by the unobserved state variables; (ii) the offline data collected a prior does not provide sufficient coverage for the environment. To tackle the above challenges, we study the policy learning in the confounded MDPs with the aid of instrumental variables. Specifically, we first establish value function (VF)-based and marginalized importance sampling (MIS)-based identification results for the expected total reward in the confounded MDPs. Then by leveraging pessimism and our identification results, we propose various policy learning methods with the finite-sample suboptimality guarantee of finding the optimal in-class policy under minimal data coverage and modeling assumptions. Lastly, our extensive theoretical investigations and one numerical study motivated by the kidney transplantation demonstrate the promising performance of the proposed methods.

show abstract

Personalized Dynamic Treatment Regimes in Continuous Time: A Bayesian Approach for Optimizing Clinical Decisions with Timing

Cited by 6 publications

References 53 publications

Relative sparsity for medical decision problems

Relative sparsity for medical decision problems

Mortality prediction in ICU Using a Stacked Ensemble Model

Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes

Contact Info

Product

Resources

About