2018
DOI: 10.1109/jstsp.2017.2787979
|View full text |Cite
|
Sign up to set email alerts
|

Optimal and Scalable Caching for 5G Using Reinforcement Learning of Space-Time Popularities

Abstract: Small basestations (SBs) equipped with caching units have potential to handle the unprecedented demand growth in heterogeneous networks. Through low-rate, backhaul connections with the backbone, SBs can prefetch popular files during off-peak traffic hours, and service them to the edge at peak periods. To intelligently prefetch, each SB must learn what and when to cache, while taking into account SB memory limitations, the massive number of available contents, the unknown popularity profiles, as well as the spa… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
182
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 236 publications
(183 citation statements)
references
References 36 publications
0
182
1
Order By: Relevance
“…The grouped linear model is introduced in [30] to obtain the predicted content requests, based on which the cache replacement is optimized by using RL with model-free acceleration. In [31], a RL framework is proposed to obtain the optimal caching strategy at SBSs taking into account the space-time dynamics of the content popularity. In [32], the probabilistic caching strategy, resource allocation, and computation offloading at fog nodes are jointly considered to minimize the average transmission delay exploiting deep RL.…”
Section: A Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The grouped linear model is introduced in [30] to obtain the predicted content requests, based on which the cache replacement is optimized by using RL with model-free acceleration. In [31], a RL framework is proposed to obtain the optimal caching strategy at SBSs taking into account the space-time dynamics of the content popularity. In [32], the probabilistic caching strategy, resource allocation, and computation offloading at fog nodes are jointly considered to minimize the average transmission delay exploiting deep RL.…”
Section: A Related Workmentioning
confidence: 99%
“…According to [44], the skewness of the Zipf-like distribution is dependent on the specific application. In brief, Zipf-like distribution is able to well depict the real user request distribution of various networks and thus is widely adopted in [18], [21], [25], [26], [31]. In this paper, we consider the mobile UDN and thus Zipf-like distribution can be safely used to model the user request distribution.…”
Section: B Content Popularity Profilementioning
confidence: 99%
“…Instead of fitting models, another option is to learn the popularity without using prior assumptions [21], [22]. For instance, [22] models the popularity evolution as a Markov process and employs Q-learning to estimate the transition probabilities which are then used for proactive caching. Such model-free solutions work well if there are adequate data, but have substantial computation and memory requirements.…”
Section: A Reactive Policiesmentioning
confidence: 99%
“…In particular, researchers are working on Mobile Edge Computing (MEC) [94] and on what, where, when, and how to log cache contents. In this context, coding [95] and machine learning [96] techniques can be applied. Examples of typical scenarios are vehicular communications [97] and low-latency applications [98].…”
Section: Cachingmentioning
confidence: 99%