2020
DOI: 10.1109/tcomm.2020.2977895
|View full text |Cite
|
Sign up to set email alerts
|

A Reinforcement Learning-Based User-Assisted Caching Strategy for Dynamic Content Library in Small Cell Networks

Abstract: This paper studies the problem of joint edge cache placement and content delivery in cache-enabled small cell networks in the presence of spatio-temporal content dynamics unknown a priori. The small base stations (SBSs) satisfy users' content requests either directly from their local caches, or by retrieving from other SBSs' caches or from the content server. In contrast to previous approaches that assume a static content library at the server, this paper considers a more realistic nonstationary content librar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 36 publications
0
5
0
Order By: Relevance
“…Based on Proposition 4 and according to the value iteration algorithm, the optimal policy that solves (22) has a threshold structure given in Equation (23). Next, we show that the thresholds η m , ∀m are uniformly bounded.…”
Section: A Proof Of Theoremmentioning
confidence: 95%
“…Based on Proposition 4 and according to the value iteration algorithm, the optimal policy that solves (22) has a threshold structure given in Equation (23). Next, we show that the thresholds η m , ∀m are uniformly bounded.…”
Section: A Proof Of Theoremmentioning
confidence: 95%
“…The optimization of content caching and delivery policy under non-stationary content libraries through a user-assisted RL algorithm is the subject of [71]. The network utility consists of backhaul traffic offloading, cache hit rate, content retrieval and delivery.…”
Section: ) Caching Efficiencymentioning
confidence: 99%
“…Content caching can be applied in many different types of communication systems from cellular networks (e.g., [6]- [8]) to device-to-device networks (e.g., [9], [10]). Many researchers focus on investigating content caching styles including deterministic, probabilistic and coded caching.…”
Section: A Related Workmentioning
confidence: 99%
“…After updating the Q-value, the agent moves to the next state and executes the action which has been chosen earlier. The Q-value for a state-action is updated by 8) where ω represents the learning rate which determines to what extent the Q-value is updated based on the newly acquired information and γ is the discount factor.…”
Section: A Rl-based Cachingmentioning
confidence: 99%