To leverage periodic disaster backup in a cloud data center (DC) network, previous studies employ disjoint unicast paths for bulk data transfers among multiple geographically distributed DCs, causing massive unnecessary traffic duplication. This not only adds the overhead but also may result in severe network congestion. With flexible network resource management in software-defined networks and powerful traffic aggregation capability of multicast, we propose capacity-constrained multicast to realize cost-efficient disaster backup. First, considering limited backup storage capacity and essential redundancy guarantee, we construct a capacity-constrained multicasting backup model. Then, we formulate the disaster backup problem as capacity-constrained multiple Steiner tree problem, which is NP-hard. To solve this problem, we design a new multicasting backup ant colony optimization algorithm based on requirement-aware growth. It directly optimizes every disaster-backup multicast tree (DBMT) from its root node to cover enough destination nodes guaranteeing sufficient redundancy and then expands them into the forest under the guidance of a multicast tree shared degree, the ratio of available storage capacity, and backup load distribution offset. We introduce unique edge fitness evaluation and pheromones for every DBMT to reduce mutual influences among multiple trees. Extensive simulations demonstrate that our strategy performs with less bandwidth consumption cost and relatively good backup load distribution fairness simultaneously.
KEYWORDScapacity-constrained, cost-efficient, disaster backup, multiple Steiner tree
INTRODUCTIONWith more and more frequent occurrence of natural disasters and human-made intentional attacks on application data centers (DCs), 1,2 a large number of data face the risk of failure. Therefore, people need to leverage periodic backup among multiple geographically distributed (geo-distributed) DCs. 3 To improve the fault-tolerant performance and obtain sufficient data redundancy, terabytes to petabytes of data produced during a period are regularly replicated and assigned to three or more other remote DCs, 4 called disaster backup. Due to the huge amount of data transfers, disaster backup activity always consumes huge bandwidth and imposes a heavy load to the links between DCs and may even cause network congestion. Therefore, to support quality-of-service requirements of various real-time applications in network, the traffic engineering problem for disaster backup among multiple DCs is of significant importance.It is noteworthy that in the research area of traffic engineering for multiple disaster-backup transfers, unicast routing has been studied widely, while multicast routing attracts much less attention. Multicast is designed to deliver contents to a large number of destinations and benefits DC group communications in three aspects. By saving network traffic, it can increase the throughput of bandwidth-hungry computations. By releasing the sender from sending multiple copies of packets to diffe...