2010
DOI: 10.1109/tvt.2010.2048766
|View full text |Cite
|
Sign up to set email alerts
|

Opportunistic Bandwidth Sharing Through Reinforcement Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
27
0

Year Published

2012
2012
2020
2020

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 61 publications
(27 citation statements)
references
References 23 publications
0
27
0
Order By: Relevance
“…RL is an unsupervised and intelligent approach that enables an agent to observe and learn about the static or dynamic operating environment in the absence of guidance, feedback or the expected response from supervisors (or external critics), and subsequently make decisions on action selection in order to achieve optimal or near-optimal system performance. RL has been adopted in the literature [8][9][10][11][12][13][14][15][16] because it does not require prior knowledge of channel availability and it is highly adaptive to the dynamicity of channels characteristics. In addition, it enables decision makers (or agents) to learn and subsequently achieve near-optimal or optimal solutions in the dynamic environment that may be complex and large-scale in nature [7,8,16,17].…”
Section: Q4mentioning
confidence: 99%
“…RL is an unsupervised and intelligent approach that enables an agent to observe and learn about the static or dynamic operating environment in the absence of guidance, feedback or the expected response from supervisors (or external critics), and subsequently make decisions on action selection in order to achieve optimal or near-optimal system performance. RL has been adopted in the literature [8][9][10][11][12][13][14][15][16] because it does not require prior knowledge of channel availability and it is highly adaptive to the dynamicity of channels characteristics. In addition, it enables decision makers (or agents) to learn and subsequently achieve near-optimal or optimal solutions in the dynamic environment that may be complex and large-scale in nature [7,8,16,17].…”
Section: Q4mentioning
confidence: 99%
“…Reinforcement learning learns a policy from which most reward is obtained through a series of actions [17]. Reinforcement learning is a broad class of optimal control methods depending on estimating value functions from experience or simulations [18][19][20][21].…”
Section: The Reinforcement Learning Ramp Meteringmentioning
confidence: 99%
“…As examples, we can mention [49][50][51] for spectrum sensing, [52,53] for spectrum sharing and [54] for spectrum access and control. We can distinguish two types of learning algorithms generally used in this context: reinforcement learning and Q-learning.…”
Section: Learning Based Approachesmentioning
confidence: 99%