2021
DOI: 10.1109/twc.2021.3066458
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Agent Reinforcement Learning for Cooperative Coded Caching via Homotopy Optimization

Abstract: Introducing cooperative coded caching into small cell networks is a promising approach to reducing traffic loads. By encoding content via maximum distance separable (MDS) codes, coded fragments can be collectively cached at small-cell base stations (SBSs) to enhance caching efficiency. However, content popularity is usually time-varying and unknown in practice. As a result, cached content is anticipated to be intelligently updated by taking into account limited caching storage and interactive impacts among SBS… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
20
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 32 publications
(20 citation statements)
references
References 33 publications
0
20
0
Order By: Relevance
“…The goal of the critic network is to minimize the loss L θ Q , since a smaller L θ Q represents a more accurate value given for states and actions. Therefore, the parameter θ Q of the evaluation network is updated by the gradient descent method, and the equation is shown as (4).…”
Section: Ddpg Learning Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The goal of the critic network is to minimize the loss L θ Q , since a smaller L θ Q represents a more accurate value given for states and actions. Therefore, the parameter θ Q of the evaluation network is updated by the gradient descent method, and the equation is shown as (4).…”
Section: Ddpg Learning Methodsmentioning
confidence: 99%
“…where A t = π θ u (S t ), and γ is the discount factor. The learning objective of the critic network is to minimize L θ Q , and the parameters θ Q of the critic network are updated by the gradient descent method, and the formula is shown in Equation (4).…”
Section: Multi-agent Interactive Learningmentioning
confidence: 99%
See 1 more Smart Citation
“…In [130], maximum distance separable codes are used for cooperative coded caching at small BSs in a scenario with time-varying and unknown content popularity. Aiming to address this challenge and guarantee reduced fronthaul load, a multi-agent DRL solution is presented for cache update.…”
Section: ) Spectral Efficiencymentioning
confidence: 99%
“…[17] also jointly considers the content caching and mode selection problem, which applies DRL for decision making of the cloud server to satisfy the requirements of slices. The multi-tenant radio resource allocation problem is investigated in [18], and DRL is leveraged to optimize the computation offloading and packet scheduling. A multi-agent DRL algorithm is proposed in [19] for the cooperative content caching of small base stations, and each agent can intelligently cache contents under time-varying popularity.…”
Section: Related Workmentioning
confidence: 99%