2020 3rd International Conference on Intelligent Sustainable Systems (ICISS) 2020
DOI: 10.1109/iciss49785.2020.9315959
|View full text |Cite
|
Sign up to set email alerts
|

An N-step Look Ahead Algorithm Using Mixed (On and Off) Policy Reinforcement Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 3 publications
0
1
0
Order By: Relevance
“…In deep reinforcement learning, the single-step average reward value of each episode is an important indicator to measure the training effect [ 30 , 31 , 32 , 33 ]. This paper counts the average single-step rewards of [ 22 ] and DCPER-DDPG algorithm in 6000 episodes.…”
Section: Results Analysismentioning
confidence: 99%
“…In deep reinforcement learning, the single-step average reward value of each episode is an important indicator to measure the training effect [ 30 , 31 , 32 , 33 ]. This paper counts the average single-step rewards of [ 22 ] and DCPER-DDPG algorithm in 6000 episodes.…”
Section: Results Analysismentioning
confidence: 99%