2012
DOI: 10.1016/j.comnet.2012.06.001
|View full text |Cite
|
Sign up to set email alerts
|

A switch-based approach to throughput collapse and starvation in data centers

Abstract: Abstract-Data center switches need to satisfy stringent lowdelay and high-capacity requirements. To do so, they rely on small switch buffers. However, in case of congestion, data center switches can incur throughput collapse for short TCP flows as well as temporary starvation for long TCP flows.In this paper, we introduce a lightweight hash-based algorithm called HCF (Hashed Credits Fair) to solve these problems at the switch level while being transparent to the end users. We show that it can be readily implem… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2012
2012
2015
2015

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(8 citation statements)
references
References 22 publications
0
8
0
Order By: Relevance
“…On the contrary, the average loads of DCNs are not very high and most of the links are not always utilized at their full ca-timeout in TCP [6,7,11,13,[18][19][20][21][22][23][24]. The authors in [6] and [18] suggest modifying the IEEE 802.1Qau switch to regulate the flow rate so that the severe drop that causes RTO (Retransmission Timeout) is less likely to happen.…”
Section: Mptcp Incast Collapsementioning
confidence: 99%
See 1 more Smart Citation
“…On the contrary, the average loads of DCNs are not very high and most of the links are not always utilized at their full ca-timeout in TCP [6,7,11,13,[18][19][20][21][22][23][24]. The authors in [6] and [18] suggest modifying the IEEE 802.1Qau switch to regulate the flow rate so that the severe drop that causes RTO (Retransmission Timeout) is less likely to happen.…”
Section: Mptcp Incast Collapsementioning
confidence: 99%
“…The authors in [6] and [18] suggest modifying the IEEE 802.1Qau switch to regulate the flow rate so that the severe drop that causes RTO (Retransmission Timeout) is less likely to happen. Shpiner et al [19] proposed using two queues to prioritize flows with a lower packet arrival rate so that it could reduce throughput disparity and make TCP less likely to resort to RTO for recovery. Alizadeh et al [20] proposed DCTCP to allow senders to start reacting as soon as the queue length on an interface exceeds a thresh value so that more buffer space is available to absorb transient micro-bursts, greatly mitigating costly packet losses that can lead to timeouts.…”
Section: Mptcp Incast Collapsementioning
confidence: 99%
“…Both [30] and [31] suggest to modify a IEEE 802.1Qau [32] switch to regulate flow rates, so that severe drop that causes incast throughput collapse is less likely to happen. In [33], two queues are used to prioritize flows with lower packet arrival rate, so that it can reduce throughput disparity and make TCP less likely to resort to RTO for recovery.…”
Section: Previous Workmentioning
confidence: 99%
“…In [15], the authors explored how Quantized Congestion Notification (QCN) [16], an Ethernet layer congestion control mechanism, operated under TCP-incast scenario in data centers, and proposed some modifications to improve its efficiency. Shpiner and Keslassy [7] proposed a new architecture called Hashed Credits Fair (HCF) to avoid TCP throughput collapse in data centers. The key idea of the scheme is to serve the incoming traffic through two queues (high priority and low priority queues) instead of one traditional DropTail one.…”
Section: Influence Of the Timer Granularitymentioning
confidence: 99%
“…These approaches require changes at the transport level, or within the operating system (OS) kernel. The other potential solution is to rely on switch-based mechanisms within the network [7].…”
Section: Introductionmentioning
confidence: 99%