2019
DOI: 10.1109/jsac.2019.2904366
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement Learning for Real-Time Optimization in NB-IoT Networks

Abstract: NarrowBand-Internet of Things (NB-IoT) is an emerging cellular-based technology that offers a range of flexible configurations for massive IoT radio access from groups of devices with heterogeneous requirements. A configuration specifies the amount of radio resource allocated to each group of devices for random access and for data transmission. Assuming no knowledge of the traffic statistics, there exists an important challenge in "how to determine the configuration that maximizes the long-term average number … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
64
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 97 publications
(64 citation statements)
references
References 27 publications
0
64
0
Order By: Relevance
“…Neural networks (NN) with multiple layers cannot explain the essential features that influence actions, nor the impact of data bias on the uncertainty of outputs. Beyond supervised learning for PHY layer signal detection, DNN is especially opaque when coupled with reinforcement learning (RL) [3], where the Markov Decision Process (MDP) is integrated with hidden layer dynamics. As such, there is the need to develop explainable algorithms that can quantify uncertainty, especially mapping data inputs, algorithm design, to the projected wireless key performance indicators (KPI).…”
Section: A Ai and Trustmentioning
confidence: 99%
“…Neural networks (NN) with multiple layers cannot explain the essential features that influence actions, nor the impact of data bias on the uncertainty of outputs. Beyond supervised learning for PHY layer signal detection, DNN is especially opaque when coupled with reinforcement learning (RL) [3], where the Markov Decision Process (MDP) is integrated with hidden layer dynamics. As such, there is the need to develop explainable algorithms that can quantify uncertainty, especially mapping data inputs, algorithm design, to the projected wireless key performance indicators (KPI).…”
Section: A Ai and Trustmentioning
confidence: 99%
“…In [6], [7], [8], a detailed explanation of NB-IoT design and physical layer procedures has been presented. In [9], a reinforcement learning-based framework to configure resources optimally for uplink in NB-IoT has been presented. In [10], authors have proposed an uplink link-adaptation scheme for the IoT devices in an NB-IoT network.…”
Section: Related Work a On Performance Evaluation Of Control Chamentioning
confidence: 99%
“…(i) With α = 0, the proposed scheduler in (9) becomes an objective function that tries to schedule more number of devices in a search space. Thus, the number of devices scheduled will always be greater than or equal to that of the other schedulers.…”
Section: Analysis Of the Proposed Schedulermentioning
confidence: 99%
“…Jiang et al address the problem of resource allocation in NB-IoT in their paper entitled "Reinforcement learning for real-time optimization in NB-IoT networks". They provide several proposals using reinforcement learning to maximize the number of served IoT devices in dynamic environments [14]. In the paper, the authors show that their approach requires less training time while the improvement of their algorithm using cooperative multi-agent learning outperforms other methods and converges to efficient solutions even at large scale.…”
Section: B Predictive Ai In Communications Networkmentioning
confidence: 99%