2019
DOI: 10.1186/s12911-019-0763-6
|View full text |Cite
|
Sign up to set email alerts
|

Inverse reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units

Abstract: Background Reinforcement learning (RL) provides a promising technique to solve complex sequential decision making problems in health care domains. To ensure such applications, an explicit reward function encoding domain knowledge should be specified beforehand to indicate the goal of tasks. However, there is usually no explicit information regarding the reward function in medical records. It is then necessary to consider an approach whereby the reward function can be learned from a set of presumab… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
38
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 51 publications
(39 citation statements)
references
References 15 publications
1
38
0
Order By: Relevance
“…Nemati et al [18] used deep RL methods to calculate optimal unfractionated Heparin from sub-optimal clinical ICU data. Yu et al [19] used inverse RL to infer the reward functions when dealing with mechanical ventilation and sedative dosing in ICUs. Chang et al [20] proposed a Q-learning method that jointly minimized the measurement cost and maximized predictive gain, by scheduling strategically-timed measurements in ICUs.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Nemati et al [18] used deep RL methods to calculate optimal unfractionated Heparin from sub-optimal clinical ICU data. Yu et al [19] used inverse RL to infer the reward functions when dealing with mechanical ventilation and sedative dosing in ICUs. Chang et al [20] proposed a Q-learning method that jointly minimized the measurement cost and maximized predictive gain, by scheduling strategically-timed measurements in ICUs.…”
Section: Related Workmentioning
confidence: 99%
“…After preprocessing, we take 10 minutes as the frequency of time series from admission time to discharge time. Please refer to [19] for more details in data processing.…”
Section: Data Processingmentioning
confidence: 99%
See 1 more Smart Citation
“…Patients in critical settings need sophisticated high-tech equipment, and specialized clinicians continuously monitor them. The patient safety and patient outcomes are more vital in such settings than in other units [13,[17][18][19]. The literature review has recognized the gap in NI competency required for nurses in different work environments and critical care settings.…”
Section: Introductionmentioning
confidence: 99%
“…Patients in critical settings need sophisticated hightech equipment, and specialized clinicians continuously monitor them. The patient safety and patient outcomes are more vital in such settings than in other units [13,[17][18][19]. The literature review has recognized the gap in NI competency required for nurses in different work environments and critical care settings.…”
Section: Introductionmentioning
confidence: 99%