2015
DOI: 10.1109/tpds.2014.2316829
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting Efficient and Scalable Shuffle Transfers in Future Data Center Networks

Abstract: Distributed computing systems like MapReduce in data centers transfer massive amount of data across successive processing stages. Such shuffle transfers contribute most of the network traffic and make the network bandwidth become a bottleneck. In many commonly used workloads, data flows in such a transfer are highly correlated and aggregated at the receiver side. To lower down the network traffic and efficiently use the available network bandwidth, we propose to push the aggregation computation into the networ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 34 publications
(8 citation statements)
references
References 28 publications
0
8
0
Order By: Relevance
“…In order to meet the increasing demands for services and better performance, the physical infrastructure should scale gracefully to accommodate concurrent jobs enabling incremental expansion without affecting the existing services. Correspondingly, the scheduling strategy should be scalable and can be easily adapted to the new expanded cloud services [18], [19].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In order to meet the increasing demands for services and better performance, the physical infrastructure should scale gracefully to accommodate concurrent jobs enabling incremental expansion without affecting the existing services. Correspondingly, the scheduling strategy should be scalable and can be easily adapted to the new expanded cloud services [18], [19].…”
Section: Related Workmentioning
confidence: 99%
“…where λ v and λ e are the recovery costs of virtual node v and virtual link e; and µvē = max{svē, dvē}. The cost for embedding e onto backbone network can be defined as in Formula (18).…”
Section: B the Objectivementioning
confidence: 99%
“…The algorithm firstly create a delay table and then iteratively find the delay cost between the adjacent nodes (lines 1-9). The sequence table is then created with all the delay paths between any two nodes and the minimum delay (lines [11][12][13][14][15][16][17][18]. In Algorithm 1, the complexity of all the iterations is O(|V|)…”
Section: Example 1 Considermentioning
confidence: 99%
“…Despite such bandwidth superiority, multicast has not been optimally utilized by the Internet during the past decades. Recently, it has achieved some successful network-level deployments in datacenter networks [9], enterprise networks and IPTV networks [10][11][12][13]. Research proposals dealing with the construction of multicasting trees that satisfy various constraints under single source scenario are abundant in the literature.…”
Section: Introductionmentioning
confidence: 99%
“…Data centers are dominating infrastructures that support various cloud computing applications [1] . To satisfy bandwidth demand, modern data centers depend heavily on fiber-optic links [2] .…”
Section: Introductionmentioning
confidence: 99%