7th IEEE International Conference on Computer and Information Technology (CIT 2007) 2007
DOI: 10.1109/cit.2007.131
|View full text |Cite
|
Sign up to set email alerts
|

Performance Evaluation of TD-Learning Methods for Bandwidth Provisioning

Abstract: Q-learning and SARSA are two methods of TDlearning. Researchers interested in this field proposed the Eligibility concept in order to speed up Q-learning and SARSA. They proved their claim by running the algorithms in a static environment. Authors of this paper have used Q-learning, SARSA and also their eligibility versions for bandwidth provisioning in DiffServ networks that is an absolutely dynamic environment. Performance of these methods in this absolutely dynamic environment is evaluated.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 24 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?