IEEE INFOCOM 2018 - IEEE Conference on Computer Communications 2018
DOI: 10.1109/infocom.2018.8485937
|View full text |Cite
|
Sign up to set email alerts
|

Low-Complexity Learning for Dynamic Spectrum Access in Multi-User Multi-Channel Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2
2

Relationship

2
6

Authors

Journals

citations
Cited by 16 publications
(8 citation statements)
references
References 20 publications
0
8
0
Order By: Relevance
“…We model each update interval as a time unit of learning and consider the MAB reward as the average cost during the interval. In particular, we use the Upper Confidence Bound (UCB) index [18,19], which is known to achieve the asymptotically optimal performance in the MAB problem, and apply it to our problem to balance the exploration-and-exploitation trade-off in finding the best threshold. We verify the performance of our learning-based γ * -threshold through simulations.…”
Section: Finding the Optimal-threshold Under Unknown Costsmentioning
confidence: 99%
“…We model each update interval as a time unit of learning and consider the MAB reward as the average cost during the interval. In particular, we use the Upper Confidence Bound (UCB) index [18,19], which is known to achieve the asymptotically optimal performance in the MAB problem, and apply it to our problem to balance the exploration-and-exploitation trade-off in finding the best threshold. We verify the performance of our learning-based γ * -threshold through simulations.…”
Section: Finding the Optimal-threshold Under Unknown Costsmentioning
confidence: 99%
“…The lemma shows that the augmentation algorithm may still work well, even when the true weight is replaced with the UCB index. The proof of the lemma is analogous to Lemma A.1 of [29] but has some differences due to 'near-optimality'. It can be found in [30].…”
Section: A Regret Performance In a Single Framementioning
confidence: 99%
“…The authors of [28] have developed fully distributed schemes that can achieve the logarithmic regret without any information exchange. In [29], the authors have successfully lowered the algorithmic complexity to O(1) while achieving the logarithmic regret performance. Although these aforementioned learning algorithms are amenable to distributed implementation, they are limited to single-hop networks, such as wireless access networks, and to saturated traffic scenarios (i.e., links always have packets to send), and thus cannot accordingly respond to packet arrival dynamics.…”
Section: Introductionmentioning
confidence: 99%
“…In this subsection, we aim to solve P2. First, we can leverage the Lyapunov technique and transform constraint C4 into queue stability constraints [30], [31]. In detail, we introduce virtual queues 𝑄 𝑖 with the following update equation:…”
Section: B Virtual Queue and One-slot Optimization Problemmentioning
confidence: 99%