2022
DOI: 10.1016/j.is.2021.101878
|View full text |Cite
|
Sign up to set email alerts
|

Electronic health records based reinforcement learning for treatment optimizing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
37
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 37 publications
(37 citation statements)
references
References 32 publications
0
37
0
Order By: Relevance
“…The actor algorithm refers to the strategy function π θ ( s , a ), i.e., learning a strategy to get a higher return. The critic algorithm is represented via the value function V φ ( s ), which estimates the value function of the current strategy, i.e., evaluating how good the actor algorithm is [ 34 ]. The AC method may update the parameters in a single step rather than having to repeat the process at the conclusion of each round thanks to the value function.…”
Section: Models and Evaluation Methodsmentioning
confidence: 99%
“…The actor algorithm refers to the strategy function π θ ( s , a ), i.e., learning a strategy to get a higher return. The critic algorithm is represented via the value function V φ ( s ), which estimates the value function of the current strategy, i.e., evaluating how good the actor algorithm is [ 34 ]. The AC method may update the parameters in a single step rather than having to repeat the process at the conclusion of each round thanks to the value function.…”
Section: Models and Evaluation Methodsmentioning
confidence: 99%
“…Shi et al presented an offline q-learning approach trained on the OhioT1DM dataset for selecting discrete doses of basal insulin at hourly intervals [37]. Similarly, Li et al used a model-based RL algorithm to learn the blood glucose dynamics of patients recovering from diabetic ketoacidosis [26]. This learned model was then used to train an online q-learning algorithm to select the optimal basal dose at three hour intervals over a 24 hour period.…”
Section: Related Workmentioning
confidence: 99%
“…Li et al [12] utilized an optimization approach for reinforcement learning on the Electronic Health Records (EHR) for the treatment. Reinforcement learning provided an efficient path for providing a decision sequentially.…”
Section: The Contribution Of the Research Work Is As Followsmentioning
confidence: 99%