2023
DOI: 10.1007/978-3-031-43421-1_27
|View full text |Cite
|
Sign up to set email alerts
|

Offline Reinforcement Learning with On-Policy Q-Function Regularization

Laixi Shi,
Robert Dadashi,
Yuejie Chi
et al.
Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 6 publications
0
1
0
Order By: Relevance
“…Lu et al (2022) focus on offline reinforcement learning and insist that data distribution is represented and handled more clearly in offline reinforcement learning. Shi et al (2022) insist on the use of learning a near-optimal policy using history data by the agent with offline or batch reinforcement learning. Wang and Wu (2022) discuss the perspective of research directions of blockchain in the domain of operations research.…”
Section: Related Workmentioning
confidence: 99%
“…Lu et al (2022) focus on offline reinforcement learning and insist that data distribution is represented and handled more clearly in offline reinforcement learning. Shi et al (2022) insist on the use of learning a near-optimal policy using history data by the agent with offline or batch reinforcement learning. Wang and Wu (2022) discuss the perspective of research directions of blockchain in the domain of operations research.…”
Section: Related Workmentioning
confidence: 99%