2018
DOI: 10.1109/jstsp.2018.2798164
|View full text |Cite
|
Sign up to set email alerts
|

Generalized Global Bandit and Its Application in Cellular Coverage Optimization

Abstract: Motivated by the engineering problem of cellular coverage optimization, we propose a novel multiarmed bandit model called generalized global bandit. We develop a series of greedy algorithms that have the capability to handle nonmonotonic but decomposable reward functions, multidimensional global parameters, and switching costs. The proposed algorithms are rigorously analyzed under the multiarmed bandit framework, where we show that they achieve bounded regret, and hence, they are guaranteed to converge to the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
20
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 29 publications
(20 citation statements)
references
References 33 publications
0
20
0
Order By: Relevance
“…MAB theory has been widely applied in wireless networks, such as power allocation in small base stations [16] [17], content placement in edge caching [18], [19], task assignment in mobile crowdsourcing [20] and mobility management in mobile edge computing [21]. Very recently, the BA problem is studied based on MAB theory, which makes online decision to strike the balance between exploitation and exploration.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…MAB theory has been widely applied in wireless networks, such as power allocation in small base stations [16] [17], content placement in edge caching [18], [19], task assignment in mobile crowdsourcing [20] and mobility management in mobile edge computing [21]. Very recently, the BA problem is studied based on MAB theory, which makes online decision to strike the balance between exploitation and exploration.…”
Section: Related Workmentioning
confidence: 99%
“…Furthermore, the weak Lipschitz condition is mild, which only has the impact on the region in the vicinity of the optimal value. This assumption is well justified in many practical applications [17]. 2) (Well-shaped region) For a region, C h,j , of depth h, the region contains a ball with a radius of ρ 2 γ h which locates in the center of C h,j .…”
Section: Regret Performance Analysismentioning
confidence: 99%
“…Dandanov et al [14] proposed an RL method to optimize the network performance and discussed the effect of antenna down tilts on network coverage and capacity. Cong et al [15] presented a generalized global bandit method to solve the cellular coverage optimization problem.…”
Section: Related Workmentioning
confidence: 99%
“…Learning-based methods tend to solve the optimization problem by learning necessary information from available data or by interactions with the surrounding environments and make suitable decisions on future action based on some learned models or in a model-free manner [14], [15].…”
Section: Introductionmentioning
confidence: 99%
“…The structured bandit setting considered in this paper can be viewed as a hidden context problem, where the objective is to do targeted advertising for a user without observing their features, as illustrated in Figure 1. Apart from ad selection, the structured bandit model also has applications in dynamic pricing (described in [7]), cellular coverage optimization (by [8]), drug dosage optimization (discussed in [9]).…”
Section: Introductionmentioning
confidence: 99%