2013 IEEE International Conference on Systems, Man, and Cybernetics 2013
DOI: 10.1109/smc.2013.135
|View full text |Cite
|
Sign up to set email alerts
|

A Predictive Q-Learning Algorithm for Deflection Routing in Buffer-less Networks

Abstract: Abstract-In this paper, we introduce a predictive Q-learning deflection routing (PQDR) algorithm for buffer-less networks. Qlearning, one of the reinforcement learning (RL) algorithms, has been considered for routing in computer networks. The RL-based algorithms have not been widely deployed in computer networks where their inherent random nature is undesired. However, their randomness is sought-after in certain cases such as deflection routing, which may be employed to ameliorate packet loss caused by content… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
12
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
4
1
1

Relationship

2
4

Authors

Journals

citations
Cited by 9 publications
(12 citation statements)
references
References 13 publications
0
12
0
Order By: Relevance
“…Reinforcement learning, which has been implemented in the proposed NN-NDD and ENN-NDD algorithms, provides a systematic framework for processing the gathered information. Various other deflection routing protocols based on reinforcement learning [8]- [10], [30] employ the Q-learning algorithm or its variants.…”
Section: Deflection Routing By Reinforcement Learningmentioning
confidence: 99%
See 4 more Smart Citations
“…Reinforcement learning, which has been implemented in the proposed NN-NDD and ENN-NDD algorithms, provides a systematic framework for processing the gathered information. Various other deflection routing protocols based on reinforcement learning [8]- [10], [30] employ the Q-learning algorithm or its variants.…”
Section: Deflection Routing By Reinforcement Learningmentioning
confidence: 99%
“…Q-learningbased deflection routing algorithms do not provide a procedure to reselect the paths that have low Q-values as a consequence of transient network conditions. The PQDR algorithm [10] enables a node to recover and reselect such paths and improve its decision-making ability. The PQDR algorithm combines the Predictive Q-routing algorithm [3] and RLDRS to optimally deflect contending flows.…”
Section: Deflection Routing By Reinforcement Learningmentioning
confidence: 99%
See 3 more Smart Citations