Abstract. Amazon recently estimated that the cost of energy for its datacenters reached 42% of the total cost of operation. Our previous research proposed an algorithm to predict how much cloud workload is expected at a specific time. This allows physical servers determined not to be needed to be placed in a low-power sleep state to save energy. If more system capacity is required, servers in a sleep state are transitioned back to an active state. In this paper, we extend our prior research by presenting both a stochastic model for state change as well as a new approach to determining the sampling frequency for performing the prediction of the expected capacity. The first result we show is that this allows the optimal prediction time horizon to be chosen. We next present a dynamic prediction quantization method to determine the optimal number of prediction calculation intervals. Both of these new algorithms allow us to predict future load within required Service Level Agreements while minimizing the number of prediction calculations. This effectively optimizes our ability to predict while minimizing the detrimental effect of additional calculations on our cloud resources. Finally, we test this model by simulating the stochastic time horizon and dynamic quantization algorithms and compare the results with three competing methods. We show that our model provides up to a 20% reduction in the number of calculations required while maintaining the given Service Level Agreement.Keywords: Cloud computing, Energy conservation, Green design, Optimal control, Prediction algorithm.
IntroductionIn 2006, it was estimated that the total energy consumption of data centers worldwide was equivalent to 1.5% of the energy consumed by the United States. This dollar amount represented over $4.5 billion, and was expected to double in a five year period [1]. For the operators of datacenters, the cost of energy is now over 42% of the total cost of operating the datacenter, and rising, according to Amazon [2]. This trend is accelerating due to the rapid adoption of cloud computing as the primary form of providing compute to users. Cloud computing is an architectural transition occurring in datacenters across the world. The traditional approach to datacenter hosting is to provide access to each physical machine (PM) to only one client. Increasingly, the traditional model of datacenter hosting is giving way to a more efficient model of computing, cloud computing. Cloud computing offers the hosting provider higher efficiency of operation by allowing