2022
DOI: 10.1145/3512798.3512815
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement Learning for Datacenter Congestion Control

Abstract: We approach the task of network congestion control in datacenters using Reinforcement Learning (RL). Successful congestion control algorithms can dramatically improve latency and overall network throughput. Until today, no such learning-based algorithms have shown practical potential in this domain. Evidently, the most popular recent deployments rely on rule-based heuristics that are tested on a predetermined set of benchmarks. Consequently, these heuristics do not generalize well to newly-seen scenarios. Cont… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
14
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 14 publications
(15 citation statements)
references
References 18 publications
1
14
0
Order By: Relevance
“…Another crucial component in the success of ADPG is its RTT-based reward that at its optimum depicts an optimal flow equilibrium. In line with our summary above, Tessler et al [44] state that for potential deployment, they would require dedicated hardware to accommodate the computational burden of deep-learning inference. In this work, we build upon [44] and devise a reward-modified variant which we call RL-CC.…”
Section: Ai-based CCmentioning
confidence: 57%
See 4 more Smart Citations
“…Another crucial component in the success of ADPG is its RTT-based reward that at its optimum depicts an optimal flow equilibrium. In line with our summary above, Tessler et al [44] state that for potential deployment, they would require dedicated hardware to accommodate the computational burden of deep-learning inference. In this work, we build upon [44] and devise a reward-modified variant which we call RL-CC.…”
Section: Ai-based CCmentioning
confidence: 57%
“…Recently, Tessler et al [44] introduced an RL-based RDMA CC algorithm called Analytic Deterministic Policy Gradient (ADPG). In several network simulation benchmarks, ADPG outperformed SOTA rule-based CC algorithms: DCQCN, SWIFT, and HPCC.…”
Section: Ai-based CCmentioning
confidence: 99%
See 3 more Smart Citations