Optimised Q‐learning for WiFi offloading in dense cellular networks

Fakhfakh, Emna; Hamouda, Soumaya

doi:10.1049/iet-com.2017.0213

Cited by 26 publications

(24 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As can be seen from Figure 3, after filtering out the invalid network whose throughput is less than the threshold V TP th in advance, the convergence speed of the Q-learning can be greatly accelerated. Figures 4 and 5 show the comparison between this paper's algorithm, Fakhfakh and Hamouda's algorithm [11], and RSS (received signal strength) algorithm based on user satisfaction, throughput, power consumption, cost, and delay under stream service. We repeatedly scatter APs 1000 times to eliminate randomness.…”

Section: Numerical and Simulation Resultsmentioning

confidence: 99%

“…e number of user-passed positions N p is equal to 10, and the number of WiFi AP is changed from 20 to 60. As can be seen from Figure 4, the WiFi offloading algorithm in this paper is superior to the Input: state set S, action set A, paired comparison matrix B, candidate network attribute matrix X, and iteration limit Z Output: trained Q-table, best action selection strategy Π * , and user satisfaction Φ sat j (1) Calculate attribute weights based on B (2) For s ∈ S, a ∈ A (3) Q(s, a) � 0 (4) End For (5) Randomly choose s ini ∈ S as the initialization state (6) While iteration < Z (7) For each state (8) If rand < ε (9) Randomly choose an action (10) Else (11) Select the action corresponding to the maximum Q value in this state. (12) End If (13) Perform a (14) Calculate Rw t (s, a) according to equation (23) (15) Observe the next state s′ (16) Update the Q- Mobile Information Systems other two algorithms in user satisfaction.…”

Section: Numerical and Simulation Resultsmentioning

confidence: 99%

“…Q-learning consists of three elements, including state, action, and reward. e state set is denoted as S and the action set is denoted as A, and the purpose of Q-learning is to obtain the optimal action selection strategy Π * to maximize the agent's discounted cumulative reward [11]. In state s ∈ S, the agent selects an action a ∈ A from the action set to act on the environment.…”

Section: Wifi Offloading Algorithm Based On Q-learning and Madmmentioning

confidence: 99%

“…According to the above references, the most challenging problem in WiFi offloading is how to make an offloading decision, that is, how to choose the most suitable WiFi AP for communication. Fakhfakh and Hamouda [11] aimed to minimize the residence time of the cellular network and optimized it by Q-learning. e reward function considers SINR, handover delay, and AP load.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

WiFi Offloading Algorithm Based on Q-Learning and MADM in Heterogeneous Networks

Sun

Zhu

2019

Mobile Information Systems

View full text Add to dashboard Cite

This paper proposes a WiFi offloading algorithm based on Q-learning and MADM (multiattribute decision making) in heterogeneous networks for a mobile user scenario where cellular networks and WiFi networks coexist. The Markov model is used to describe the changes of the network environment. Four attributes including user throughput, terminal power consumption, user cost, and communication delay are considered to define the user satisfaction function reflecting QoS (Quality of Service), and Q-learning is used to optimize it. Through AHP (Analytic Hierarchy Process) and TOPSIS (Technique for Order Preference by Similarity to an Ideal Solution) in MADM, the intrinsic connection between each attribute and the reward function is obtained. The user uses Q-learning to make offloading decisions based on current network conditions and their own offloading history, ultimately maximizing their satisfaction. The simulation results show that the user satisfaction of the proposed algorithm is better than the traditional WiFi offloading algorithm.

show abstract

Section: Numerical and Simulation Resultsmentioning

confidence: 99%

Section: Numerical and Simulation Resultsmentioning

confidence: 99%

Section: Wifi Offloading Algorithm Based On Q-learning and Madmmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

WiFi Offloading Algorithm Based on Q-Learning and MADM in Heterogeneous Networks

Sun

Zhu

2019

Mobile Information Systems

View full text Add to dashboard Cite

show abstract

“…Rashed et al studied the reinforcement learning that maximizes the sum-rate of D2D users and cellular users to minimize the interference in a D2D environment [21]. Fakhfakh and Hamouda used the received SINR from the access point (AP) detected by the mobile user, QoS metrics about the channel load, and delay as the reward for choosing a WiFi over a cellular network to apply WiFi offloading and reducing the load on the cellular network [22]. Yan et al propose a smart aggregated radio access technologies (RAT) access strategy with the aim of maximizing the long-term network throughput while meeting diverse traffic quality of service requirements by using Q-learning [23].…”

Section: Related Workmentioning

confidence: 99%

Reinforcement learning-based dynamic band and channel selection in cognitive radio ad-hoc networks

Jang

Han

Lee

et al. 2019

J Wireless Com Network

View full text Add to dashboard Cite

In cognitive radio (CR) ad-hoc network, the characteristics of the frequency resources that vary with the time and geographical location need to be considered in order to efficiently use them. Environmental statistics, such as an available transmission opportunity and data rate for each channel, and the system requirements, specifically the desired data rate, can also change with the time and location. In multi-band operation, the primary time activity characteristics and the usable frequency bandwidth are different for each band. In this paper, we propose a Qlearning-based dynamic optimal band and channel selection by considering the surrounding wireless environments and system demands in order to maximize the available transmission time and capacity at the given time and geographic area. Through experiments, we can confirm that the system dynamically chooses a band and channel suitable for the required data rate and operates properly according to the desired system performance.

show abstract

Machine Learning Paradigms in Wireless Network Association

Wang¹,

Jiang²

2018

Encyclopedia of Wireless Networks

View full text Add to dashboard Cite

Optimised Q‐learning for WiFi offloading in dense cellular networks

Cited by 26 publications

References 15 publications

WiFi Offloading Algorithm Based on Q-Learning and MADM in Heterogeneous Networks

WiFi Offloading Algorithm Based on Q-Learning and MADM in Heterogeneous Networks

Reinforcement learning-based dynamic band and channel selection in cognitive radio ad-hoc networks

Machine Learning Paradigms in Wireless Network Association

Contact Info

Product

Resources

About